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STUDIES OF DISTRIBUTED PRACTICE: VIII. 


LEARNING 


AND RETENTION OF PAIRED NONSENSE SYLLABLES 
AS A FUNCTION OF INTRALIST SIMILARITY 


BENTON J. UNDERWOOD! 


Northwestern University* 


The rate of learning paired-adjec- 
tive lists is unrelated to intertrial 
intervals up to 2 min. when lists are 
presented at a 2:2-sec. rate, i.e., 2 sec. 
for the stimulus alone and 2 sec. for 
the stimulus and response appearing 
together (8). With materials of low 
meaningfulness the data are somewhat 
contradictory. In one study Hovland 
(4) found that a 2-min. rest after each 
trial resulted in no faster learning than 
did a 6-sec. rest for paired nonsense 
syllables. In a subsequent study (5) 
however, the same conditions did pro- 
duce more rapid learning with spaced 
practice than with massed. Further- 
more, the differences were magnified 
if pairs were presented at a -1: l-sec. 
rate. No resolution of these conflict- 
ing data is available. 

Early studies by Hovland (e.g., 3) 
have consistently shown that in serial 
learning distributed practice facilitates 
acquisition. Previously in his writ- 


1R. L. Morgan and E. J. Archer supervised 
the gathering of the data; Mr. Archer and Jack 
Richardson are largely responsible for the statis- 
tical analysis. 

2 This work was done under Contract N7onr- 
45008, Project NR 154-057, between North- 
western University and The Office of Naval 
Research. 


ings, therefore, Hovland had theo- 
retically tied the phenomenon of 
facilitation by distribution to remote 
associations which are formed in serial 
learning. But, having found that 
facilitation will occur with spaced 
practice of paired nonsense syllables, 
his last theoretical position removes 
the tie to serial associations and pos- 
tulates what seems to be a work-inhi- 
bition theory. This theory (5) ap- 
pears to be one which simply states 
that the more work done per unit of 
time the greater the likelihood that 
distribution will facilitate learning. 
Considerable evidence could be mar- 
shalled in support of this position. If 
one is to hold to such a theory, how- 
ever, specification of the relationship 
between work and other variables 
should be suggested. One such vari- 
able is meaningfulness. That mean- 
ingfulness is in some way a variable in 
determining whether or not spaced 
practice facilitates is shown by the 
fact that learning of serial adjectives 
is not influenced as much by spaced 
practice as is learning of serial non- 
sense lists (9,12). Also, the learning 
of short verbal-discrimination lists of 
adjectives presented at a 1: 1-sec. rate 
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is not influenced by spaced practice 
(13). Of course, to handle such find- 
ings, one could merely postulate that 
the greater the meaningfulness, the 
less the work required. 

Another variable which may have 
to be related to work theory, or at 
least incorporated into some theory of 
distribution, is intratask similarity. 
In the first place, Hovland’s discard of 
the need for remote associations in 
learning before distribution will facili- 
tate learning is not easy to accept. 
One must first inquire into the basic 
processes which produce remote asso- 
ciations. One defensible position is 
that remote associations represent 
generalization tendencies. If this is 
so, and if generalization tendencies 
exist in paired-associate learning (as 
seems likely, e.g., 8), the quantitative 
continuity between serial and paired- 
associate learning can be maintained 
on the basis of the argument that no 
essentially different processes are rep- 
resented by the two forms of learning. 
It is thus still plausible that the key 
to a theory of distributed practice lies 
in number of generalization tendencies 
(produced by intratask similarity) and 
not in the amount of work. Again, 
however, work theory could simply 
state that the greater the generaliza- 
tion the greater the work. 

Accumulating evidence suggests 
that no simple theory of work inhibi- 
tion will be entirely satisfactory for 
both learning and retention phe- 
nomena associated with spaced prac- 
tice. It has been shown that while 
distribution of practice of paired- 
adjective lists does not affect acquisi- 
tion rates, differences in retention 
occur as a function of the intertrial 
rests during learning (8). In another 
study (11) intralist similarity was 
varied in serial nonsense lists. The 
results showed that facilitation oc- 
curred with distributed practice but 


this facilitation was no greater with 
lists of high similarity than with those 
of low similarity. If one did wish to 
postulate that work and intralist sim- 
ilarity are directly related, such evi- 
dence is difficult to handle. And in 
this same experiment, although intra- 
list similarity produced wide differ- 
ences in rate of learning, errors did 
not increase systematically with intra- 
list similarity as one would expect by 
generalization theory. In brief, we 
are short on acceptable theory to 
account for the few well-established 
facts we have and we are short on 
empirical laws of some generality 
describing relationships between inter- 
trial rest, related variables, and learn- 
ing-retention phenomena. 

The first purpose of the present 
experiments is to provide additional 
facts on the effect of intertrial rest in 
learning paired-associate nonsense 
lists. As pointed out above, Hov- 
land’s data are contradictory on this 
matter. The second purpose is to 
get information on the relationship 
between intralist similarity and inter- 
trial interval as they affect acquisition. 
The third purpose is to measure reten- 
tion as a function of the two variables. 


PROCEDURE 


General.—There were five separate experi- 
ments. Within each experiment (in which simi- 
larity is constant) there are three conditions of 
distribution, namely, 4, 30, and 60 sec. between 
each trial. Similarity differs among experiments, 
with intralist stimulus similarity and intralist 
response similarity varying independently. In 
Exp. I both stimulus similarity and response 
similarity arelow. In Exp. IIs intralist stimulus 
similarity is medium and response similarity low. 
In Exp. IIIs, intralist stimulus similarity is high 
and response similarity is low. In Exp. IIr, 
intralist response similarity is medium and stim- 
ulus similarity low. In Exp. IIIr, intralist 
response similarity is high, and stimulus similar- 
ity low. Actually, the same lists were used in 
Exp. IIr and IIIr as in IIs and IIIs, simply by 
“turning the lists over” so that what were 
formerly stimuli now became responses, and the 
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former responses now became stimuli. To 
summarize: 


Stimulus Response 
Experiment Similarity Similarity 
I Low Low 
IIs Medium Low 
IIIs High Low 
IIr Low Medium 
IlIr Low High 


Lists.—All lists were constructed of nonsense 
syllables of 46.67% to 53.33% association value 
according to Glaze (2). Each list consisted of 
ten pairs. In the above conditions low similarity 
means that there was no repetition of any con- 
sonant among stimulus (or response) items and 
no vowel was used more than three times. In 
medium similarity each of five letters was used 
twice as the first letter of a syllable and each 
of five different letters was used twice to end a 
syllable. In high similarity only three letters 
were used to start the ten syllables (two letters 
started three and one started four). Likewise, 
only three (although different) consonants were 
used to end syllables. In no case was a vowel 
(center letter) used more than three times in a 
list and commonly each occurred twice. Thus, 
for a low stimulus (or response) series, 20 con- 
sonants were used to make up the 10 items; for 
medium similarity 10 were used, and for high 
similarity, 6. To avoid stimulus-response gen- 
eralization similarity between stimuli and re- 
sponses was kept as low as possible. Obviously 
some repetition was unavoidable but the basic 
principle followed was that no letter used to 
start a syllable on the stimulus side was used 
to start a syllable on the response side. Repeti- 
tion did occur between the first letters of stimulus 
syllables and last letters of response syllables or 
between first letters of response syllables and 
last letters of stimulus syllables. Also, repetition 
between lists was kept as low as possible since 
each experiment required three lists, one for each 
distribution interval. Finally, one practice list 
(of medium similarity) was used for all five exper- 
iments. All lists were presented on a Hull-type 
drum at a 2:2-sec. rate with learning by the 
anticipation method, in which S spelled the 
syllable. Three orders of presentation were used 
to avoid serial learning. 

Specific conditions—Each experiment em- 
ployed 36 college students as Ss. Each S had 
one practice session and four experimental ses- 
sions, the last experimental session requiring 
only recall and relearning of the list learned the 
previous day. On the practice day S learned 
the list to 6 out of 10 correct responses on a single 
trial, the learning being by massed practice (4 
sec. between trials). He was then instructed 
concerning the rest-interval activity to be used 


to fill the intertrial rests during distributed prac- 
tice. This activity, symbol cancellation, has 
been explained in detail in a previous report (10). 
Following these instructions S continued learn- 
ing with a 30-sec. rest between each trial until 
one perfect trial was achieved. After 5 min., 
recall and relearning took place. On experi- 
mental days S learned three lists, one under each 
of the three intertrial rest conditions (4, 30, 60 
sec.), but, of course, only one list each day. 
Learning was always carried to one perfect reci- 
tation and recall and relearning (by massed prac- 
tice) occurred after 24 hr. A given list was 
always recalled and relearned before learning the 
list which in turn was recalled and relearned the 
following day. 

With three conditions and three lists, com- 
plete counterbalancing of both lists and condi- 
tions is achieved with 36 Ss. The groups of Ss 
were matched on the common practice list. 


RESULTS 


Practice list—The mean number of 
trials to learn the practice list was 
29.75, 29.06, 29.42, 28.72, and 30.33 
for Exp. I, IIs, IIIs, IIr, and IIIr, 
respectively. The variance is homo- 
geneous for the distributions on which 
these means are based and F is less 
than 1. For all groups combined the 
product-moment correlation between 
trials to learn the practice list and 
trials to learn the three experimental 
lists combined is .57+ .07. The 
mean number of errors per trial on the 
practice list was 1.23, 1.41, 1.07, 1.24, 
and 1.33 for the five groups in order. 
The F is 1.56 and with 4 and 175 df 
an F of 2.42 is needed for significance 
at the 5% level of confidence. The 
correlation between error frequency 
on practice list and on three experi- 
mental lists combined was .65 + .07. 
It may be concluded that the groups 
were initially comparable. 

Learning of experimental lists —The 
mean number of trials to learn the 
experimental lists for the three inter- 
trial rest intervals for all five experi- 
ments is shown in Table | and plotted 
in Fig. 1. The statistical analysis 
initially deals with Exp. I, IIs, and 
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TABLE 1 


Mean Triats To LEARN ParrED NONSENSE 
Lists as A FuncTIoN or STIMULUS AND 
Response SIMILARITY AND LENGTH 
or INTERTRIAL Rest* 














Intertrial Rest 

Exp. 
4 sec. 30 sec. 60 sec. 
I 23.97 22.42 23.33 
IIs 24.78 23.44 28.28 
IIIs 29.86 32.89 29.39 
IIr 24.19 21.42 21.50 
IIIr 25.22 23.69 24.27 














* Estimate of om is 1.30. 


IIIs independently, and then with 
Exp. I, IIr, and IIIr. In this and all 
other analyses to be presented, vari- 
ance was shown not to be hetero- 
geneous. 

The essential terms of analysis of 
variance for Exp. I, IIs, and IIIs are 
shown in Table 2. The rationale and 
proof of this technique have been pre- 
sented elsewhere (1). For evaluation 
of significance of similarity the proper 
error term is Ss/Similarity. The F 
of 5.29 is beyond the value (4.82) 
needed for significance at the 1% 
level. For evaluating the effects of 
other variables the appropriate error 
term is Pooled Ss X Practice/Similar- 
ity. Terms necessary for evaluating 
list differences within a given level of 
similarity are not included. It will 
be observed that intertrial rest is not 
a significant source of variance, F 
being less than 1. In Fig. 1 the curves 
for Exp. IIs and IIIs (medium and 
high stimulus similarity) have quite 
different shapes. This is reflected in 
the interaction term in Table 2 (Inter- 
trial Rest X Similarity) which is sig- 
nificant at about the 3% level of 
confidence. The rather bizarre shapes 
for these two curves argue against 
accepting this interaction as being 
psychologically meaningful. From 
the analysis, however, two conclusions 





BENTON J. UNDERWOOD 


are straightforward for these three 
experiments in which interlist stim- 
ulus similarity was manipulated: (a) 
as interlist stimulus similarity in- 
creases, rate of learning decreases, and 
(b) intertrial rest produces no differ- 
ences in learning for any level of 
similarity. 

Turning next to results produced by 
variation in response similarity (Exp. 
I, Ilr, and IIIr), we may first note 
that similarity is not an effective vari- 
able as far as trials to learn is con- 
cerned. The complete statistical anal- 
ysis will not be presented. It is 
sufficient to report that F for similar- 
ity is less than 1. Thus, variation in 
similarity which produces differences 
in rate of acquisition when it obtains 
among stimuli has little influence on 
learning when present among re- 
sponses. Although Fig. 1 might sug- 
gest that learning was more rapid 
under conditions with 30- and 60-sec. 
intertrial rests than for 4-sec. rest for 
Exp. I, IIr, and IIIr, the F for inter- 
trial rest (2.10) falls considerably short 
of the value (3.04) needed for the 5% 
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INTERTRIAL REST INTERVAL 


Fic. 1. Learning as a function of intertrial 
interval and intralist similarity. I, IIs, and 
IIIs indicate increasing stimulus similarity; I 
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IIr, and IIIr indicate increasing response 
similarity. 
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level. Again, then, we have no evi- 
dence to support Hovland’s (5) finding 
that distributed practice will facili- 
tate acquisition of paired nonsense 
lists. 

Finally, it must be pointed out that 
the lists in which response similarity 
was manipulated were learned more 
rapidly than those in which stimulus 
similarity was manipulated. Disre- 
garding intertrial rest, the mean num- 
ber of trials to learn the medium 
stimulus-similarity lists (Exp. IIs) 
was 25.50. When these lists- were 
“turned over” so that similarity was 
among responses the mean was 22.37 
(Exp. IIr). The corresponding val- 
ues for the high-similarity lists (Exp. 
IIIs and IIIr) were 20.71 and 24.40. 
F is highly significant (5.36). The 
analysis of errors during learning, to 
which we now turn, adds information 
which will be useful in interpreting 
these differences. 

Errors during learning —The mean 
number of overt errors per trial in 
learning is shown for each condition 


TABLE 2 


ANALYsIs OF VARIANCE FOR MEAN NuMBER 
or TrIAts TO LEARN IN 
Exp. I, IIs anv IIIs 











Source of Variation af pnd F* 
Similarity 2 |1586.0402 | 5.29 
Ss/Similarity 105 | 299.8429 
Practice 2 | 884.0216 | 14.84 
Practice X Similarity 4 3.0679 | — 
Intertrial rest 2] 21.5772 | — 
Intertrial rest 

X Similarity 4] 177.3040 | 2.98 
Lists/Similarity (pooled)| 6 |———** 


Pooled Ss XK Practice/ 
Similarity 198 


Total 323 


59.5600 














* With 2 and 100 (105) df, F at .05 level is 3.09, and 
at .01, 4.82. With 2 and 200 (198) df, F at .0S level is 
3.04, and at .01, 4.71. With 4 and 200 (198) df, F at .0S 
is 2.41, and at .01, 3.41. 

** This mean square has no meaning except when 
broken down to show differences in list difficulty within 
each similarity level. 
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Fic. 2. Overt errors per trial in learning as 
a function of intertrial interval and intralist 
similarity 


in Fig.2. The first fact demonstrated 
by Fig. 2 is that variation in response 
similarity (IIr and IIIr) resulted in 
greater error frequency than did vari- 
ation in stimulus similarity (IIs and 
IIIs). Thus, while learning was more 
rapid with variation in response simi- 
larity than with variation in stimulus 
similarity, more errors per trial were 
made in the former instance. 

A second fact is that similarity was 
a significant source of variance for 
Exp. I, IIr, and IIIr (F is 13.2) with 
error frequency varying directly with 
similarity. With variation in stim- 
ulus similarity F is 4.15, which falls 
between the 1% and 5% levels. Here 
there is no direct relation between 
similarity and error frequency since 
the number of errors for the high- 
similarity lists (IIIs) is slightly less 
than for the lists of medium similarity 
(IIs). 

A third fact to be noted in Fig. 2 is 
that for Exp. I, IIs, and IIIs, error 
frequency decreases with increasing 
length of intertrial rest. The F is 8.31 
which is well beyond the 1% confi- 
dence level. On the other hand, no 
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relation obtains between error fre- 
quency and intertrial rest for variation 
in response similarity. 

Recall.—All lists were recalled and 
relearned 24 hr. following original 
mastery. In analyzing the recall 
scores for Exp. I, IIs, and IIIs, only 
two significant sources of variance 
were found. One of these was stage 
of practice, with poorer recall asso- 
ciated with later stages of practice. 
This confirms previous findings with 
serial lists (11). The second signifi- 
cant source of variance was similarity, 
with recall better the higher the simi- 
larity. Intertrial interval during 
learning had no influence on retention 
as measured by recall. 

When Exp. I, IIr, and IIIr were 
subjected to analysis of variance, only 
stage of practice was found to be sig- 
nificant. Unlike variation in stimulus 
similarity, variation in response simi- 
larity produced no differences in 
recall. It will be remembered that 
variation in stimulus similarity pro- 
duced differences in rate of learning 
whereas response variation did not. 
It is possible, therefore, that differ- 
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Fic. 3. Recall as a function of stimulus sim- 
ilarity. The base line indicates the number of 
correct responses made for an item during orig- 
inal learning. 
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ences in recall as a function of stimulus 
similarity may be due to differences in 
number of reinforcements (correct 
responses) during original learning. 
To examine this possibility an item 
analysis of learning has been made in 
which number of reinforcements is 
held constant. When this is done, 
differences in recall as a function of 
stimulus similarity disappear. The 
results of this analysis for Exp. I, IIs, 
and IIIs are shown in Fig. 3. When 
items having the same number of rein- 
forcements are compared (ignoring 
intertrial interval), there are no appre- 
ciable differences in recall as a function 
of similarity. The raw recall scores 
also showed better retention for Exp. 
IIs and IIIs than for IIr and IIIr. 
Here again, however, when compared 
by item analysis so that frequency of 
reinforcement is equal, no differences 
of any consequence are apparent. It 
must be concluded that differences in 
similarity per se have little influence 
on retention as measured by recall. 
This confirms previous results with 
paired adjectives (8). 

It was pointed out earlier that more 
errors occurred in learning lists in 
which response similarity varied than 
in those in which stimulus similarity 
varied. It might be expected that 
such differences would occur in overt 
errors at recall and such is the case. 
For Exp. IIs and IIIs the total errors 
for all three conditions was 150 and 
104 respectively; for Exp. IIr and IIIr 
the corresponding values were 171 and 
186. 

Relearning.—Taken singly, none of 
the major variables influences rate of 
relearning. However, for both stim- 
ulus similarity and response similarity 
the interaction between stage of prac- 
tice and similarity is highly significant. 
The F for stimulus similarity is 6.02 
and for response similarity, 5.33, with 
an F of 4.71 needed for the 1% level. 
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With high similarity (either IIIs or 
IIIr), the mean number of trials to 
relearn decreases as a function of stage 
of practice, the values being 9.33, 
8.28, and 6.06 for Exp. IIIs and 8.02, 
7.36, and 6.89 for IIIr, for the three 
stages of practice in order. For low 
similarity (Exp. I), on the other hand, 
the mean number of trials to relearn 
increases as a function of stage of prac- 
tice, these means being 5.33, 6.08, and 
7.58. This interaction between simi- 
larity and stage of practice may be 
expected on the basis of differences in 
interlist similarity which resulted from 
variation in intralist similarity. In 
these experiments three experimental 
lists were used for each level of simi- 
larity. Intralist similarity was varied 
by manipulating the number of re- 
peated consonants. With low intra- 
list similarity consonants became 
repeated among lists as a result of 
avoiding duplication within lists. 
With high similarity repetition among 
consonants in different lists was 
avoided because of high repetition 
within lists. Therefore, we would 
expect considerable interlist interfer- 
ence for low similarity lists and amount 
of interference should be directly 
related to stage of practice, i.e., num- 
ber of previous lists learned. Thus, 
it would seem that with low-similarity 
lists, interference among lists more 
than counteracted the effects of prac- 
tice, while this was not true in the 
high-similarity lists. Other measures 
(e.g., recall scores) are consistent with 
this hypothesis but since this finding 
is secondary to the major purpose of 
the paper, such data will not be 
detailed here. 

Finally, it may be mentioned that 
the errors per trial during relearn- 
ing followed a pattern almost identical 
with that during original learning. 


Discussion 


In none of the five experiments has 
any facilitation resulted in learning by 
spaced practice as compared with 
massed. Furthermore, the results 
argue against a simple work-inhibition 
type theory. In Hovland’s (5) study 
in which facilitation by distributed 
practice did occur, the mean number 
of trials to learn was 16.4 for the 
massed condition and 13.5 for the 
spaced condition. In the present 
experiments the mean number of trials 
varied from 22 to 33. If work inhi- 
bition is involved, it would be sus- 
pected that greater inhibition would 
have occurred with massed practice in 
our experiments than in Hovland’s 
experiments. Now, it is true that 
Hovland’s distributed condition con- 
sisted of a 2-min. intertrial rest where- 
as our longest interval was 1 min. 
But, some consistent trend toward 
faster learning with distributed prac- 
tice should be present in our data even 
with l-min. rests if work inhibition 
develops. Such was not the case. 
Another possibility to account for the 
discrepancy may be contained in the 
fact that Hovland’s Ss were extremely 
well practiced. This is reflected in 
the mean values given above. In a 
previous study (9) we have shown that 
for serial learning differences between 
rate of learning under massed and dis- 
tributed practice do not change appre- 
ciably as a function of stage of practice 
but it is still remotely possible that 
stage of practice could be a variable 
for paired-associate learning. For the 
present the contradiction between 
Hovland’s two studies (4,5) and our 
results cannot be _ satisfactorily 
resolved. 


Confirming previous findings (8), the present 
results give no support to generalization as a 
critical process in learning paired-associate lists 
by massed and distributed practice. With vari- 
ation in stimulus similarity wide differences in 
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rate of learning occurred but no facilitation by 
spaced practice was observed for any level of 
similarity. There was, then, no interaction 
between similarity and intertrial rest. Similarly, 
in learning serial nonsense lists (11) interaction 
between these two variables has not been signifi- 
cant. However, interaction between intertrial 
rest and similarity did appear in an earlier study 
(12) using serial adjectives, with lists of high 
similarity showing more facilitation than lists of 
low similarity. This finding is being checked at 
the present time with a more extensive series of 
conditions. Except for this one case, variation 
in intratask similarity does not seem to reveal 
processes which are essential to the study of the 
influence of distributed practice. 

The data have shown that while variation in 
stimulus similarity among paired associates pro- 
duces significantly different rates of learning, 
comparable variation among response terms does 
not. That similarity among responses has an 
influence is shown by the direct increase in overt 
errors with increase in similarity. It seems 
likely that with increasing similarity among 
responses not only are interference tendencies 
increased but also factors making for facilitation 
in learning. It has been shown elsewhere (7) 
that in a transfer situation with variation in 
response similarity between lists, large amounts 
of positive transfer may occur even with a high 
frequency of errors from the first to the second 
list. In the present experiments with increase 
in response similarity the number of different 
letters to be given by S in spelling the syllables 
decreases. Thus, as similarity increases, S must 
remember fewer individual letters (which should 
facilitate acquisition) but must discriminate 
which particular combinations of these few 
letters must be attached to the different stimuli 
(which should retard learning). If these two 
processes are in rough balance, little change in 
rate of learning as a function of response similar- 
ity should be expected. On the other hand, with 
increase in stimulus similarity only the inhibiting 
or interfering factor increases. S has to learn to 
respond with dissimilar responses to increasingly 
similar stimuli, As similarity among these 
stimuli increases, discrimination among the 
stimuli to which the discrete responses are to be 
attached becomes more difficult. In this situ- 
ation, however, it is not clear why errors do not 
increase directly with increase in stimulus simi- 
larity. The fact that the same result was evi- 
dent in serial learning of nonsense syllables (11) 
shows that the law has some generality but no 
process has been suggested to account for it. 

The final result requiring comment is the 
decrease in errors as a function of intertrial rest; 
with variation in stimulus similarity overt errors 
decreased directly with length of intertrial rest. 


With variation in response similarity no such 
relationship was observed. It may also be 
reported that in a comparable series of experi- 
ments (manuscript in preparation) with paired 
adjectives, about the same results were obtained 
except that some decrease in errors (significant 
at 5% level) was present with variation in 
response similarity. Such findings might be 
interpreted as evidence for a differential-for- 
getting theory as proposed by McGeoch (6). 
This theory states that erroneous response ten- 
dencies, since they are weaker than correct 
response tendencies, will drop out or be forgotten 
more rapidly over a rest interval. If this is the 
case in the present experiments, we would expect 
faster learning by distributed practice (unless 
some other process counteracts). The theory 
has been offered to account for faster learning by 
distributed practice than by massed; here we 
find evidence for the theory in terms of errors, 
but there is no facilitation by distributed prac- 
tice. Error frequencies for the present experi- 
ments have been analyzed by Vincent-type 
curves as a function of stage of learning a given 
list. The massed and distributed conditions 
were plotted separately. This analysis showed 
no appreciable differences in the shapes of the 
curves as a function of intertrial interval although 
there was a trend for the differences in error 
frequency to be greater early in learning than 
late in learning. If it is reasonable to suppose 
that weak error tendencies are more frequent 
early in learning than late in learning, then these 
error curves could be said to support a differ- 
ential-forgetting theory. On the other hand, 
the fact that the error frequencies during relearn- 
ing by massed practice were less for lists learned 
by distribution than for lists learned by massing 
would argue against the differential-forgetting 
theory unless some permanent error-reduction 
process is postulated. It must be repeated again, 
however, that no differences in learning, recall, or 
relearning were found so that if the differential- 
forgetting theory is said to be supported by the 
error data, it must be quickly added that 
no noticeable effect on correct responses was 
observed at any time. In previous experiments 
with serial lists (10) and in the present experi- 
ments, error frequency seems to bear little rela- 
tionship to performance measured by correct 
responses. 

Another factor which might be relevant to 
error frequencies is the rest-interval activity. 
It was demonstrated earlier (10) that color 
naming during the distribution interval produces 
many more errors than does symbol cancellation 
(the activity used in the present experiments). 
Furthermore, the same differences were present 
during relearning by massed practice. These 
differences were interpreted to be the conse- 
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quence largely of a responding set induced by 
color naming although we were unable to rule 
out entirely an error-depressant effect of symbol 
cancellation. In symbol ca -ellation very few 
errors are made; instructions for accuracy are 
given each day S cancels. It is possible that S 
develops a tendency to make fewer errors in 
learning as a consequence of actually making 
very few errors in symbol cancellation. The 
present data are not well suited for an internal 
analysis by stage of practice to determine if such 
a set is developing. Such an analysis has been 
made and the results show trends which would 
support the set hypothesis but the differences do 
not attain statistical significance. Moreover, if 
such an error-reducing set is built up, it is not a 
general finding since it has not been present in 
serial learning (11) nor has it been evident in the 
present experiments in which response similarity 
was manipulated. For the present, the corre- 
lates of the phenomenon of fewer errors with 
longer intertrial rests must be left unspecified. 


SUMMARY 


Five experiments were performed to 
study the effect of (a) intertrial rest 
and (5) intralist stimulus and intralist 
response similarity on learning and 
retention of lists of paired nonsense 
syllables. The three intertrial inter- 
vals in each experiment were 4, 30, 
and 60 sec. In one experiment the 
lists had low stimulus and low response 
similarity; in two others the stimulus 
similarity was medium and _ high, 
respectively, with response similarity 
low; in the other two experiments the 
response similarity was medium and 
high, respectively, with stimulus simi- 
larity low. Similarity was manipu- 
lated by varying the frequency of 
repeated letters making up the syl- 
lables of a given list. All learning 
was carried to one perfect trial with 
retention of each list measured after 
24 hr. Each experiment involved 36 
Ss. The results show: 

1. There was no influence of inter- 
trial rest on rate of learning in any of 
the five experiments. Overt error 
frequency was inversely related to 
intertrial rest for experiments in which 
stimulus similarity was manipulated. 


2. Difficulty of learning increased 
with increase in stimulus similarity 
but remained unchanged for variation 
in response similarity. Overt errors 
increased directly with response simi- 
larity but not with stimulus similarity. 

3. No differences in retention oc- 
curred as a function of either intertrial 
rest or similarity. 

The results do not support Hov- 
land’s latest study in which learning 
of paired-nonsense lists was facilitated 
by distributed practice. The major 
differences in procedure are that 
Hovland used a 2-min. intertrial rest 
(compared to the maximum 1-min. 
rest in the present experiments) and 
his Ss were much better practiced 
than those used here. The present 
findings further support the conclusion 
that intratask similarity is not an 
important variable in the study of the 
influence of intertrial rests on learning. 
The fact that variation in stimulus 
similarity produced differences in rate 
of learning whereas response similarity 
did not may be explained on the basis 
of the different roles played by simi- 
larity in the two cases. The finding 
that fewer errors occurred with dis- 
tributed practice than with massed 
practice when stimulus similarity is 
varied is unaccounted for, but such 
differences in errors appear to have no 
relationship to rate of learning. 


(Received August 18, 1952) 
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STUDIES OF DISTRIBUTED PRACTICE: IX. LEARNING 
AND RETENTION OF PAIRED ADJECTIVES AS A 
FUNCTION OF INTRALIST SIMILARITY 


BENTON J. UNDERWOOD! 


Northwestern University * 


In the second study in this series (6) 
results were presented on learning and 
retention of paired-adjective lists with 
two levels of intralist similarity. Each 
high-similarity list consisted of five 
sets of four synonymous adjectives 
making ten stimuli and ten responses. 
Of the four synonyms in a set, two 
were used as stimuli and two were 
used as responses so that stimulus 
generalization, response generaliza- 
tion and stimulus-response generaliza- 
tion would occur. The results showed 
that intertrial rests up to 2 min. did 
not affect rate of learning for either 
the high- or low-similarity lists. Re- 
tention after 24 and 48 hr. was better 
following distributed practice than 
following massed only for lists of low 
similarity. Retention was not related 
to similarity when items were equated 
for number of correct responses during 
original learning. This latter finding 
was taken as tentative evidence 
against Gibson’s (2) theory which pre- 
dicts poorer retention of high-simi- 
larity lists than of low because of the 
greater recovery of generalization 
tendencies (interfering tendencies) in 
the high-similarity lists than in the 
low. Yet, since Gibson’s theory deals 
only with stimulus generalization, it 


1R. L. Morgan and E. J. Archer supervised 
the gathering of the data; Mr. Archer and Jack 
Richardson are largely responsible for the sta- 
tistical analysis. 

2 This work was done under Contract N7onr- 
45008, Project NR 154-057, between North- 
western University and The Office of Naval 
Research. 


was perhaps not a fair test of the 
theory to include response generaliza- 
tion and stimulus-response generaliza- 
tion in the lists of high similarity. 
One purpose of the present study is to 
make a specific test of the Gibson hy- 
pothesis by varying stimulus similarity 
and response similarity independently. 
The present study is very compa- 
rable to the eighth study (9) in this 
series except that in that study non- 
sense syllables were used instead of 
adjectives. Variation in similarity of 
syllables was accomplished by varying 
number of repeated letters among syl- 
lables. This is unsatisfactory as a 
test of the Gibson theory since there is 
no variation along a dimension of 
similarity as is the case in the present 
experiments with paired adjectives. 
Thus, the fact that we found no dif- 
ferences in retention with variation in 
similarity among nonsense syllables 
cannot be used as clear evidence 
against the Gibson hypothesis. 


PROCEDURE 


General.—There were six independent experi- 
ments of three conditions each, with 36 Ss 
assigned to each experiment. Within each ex- 
periment, three intertrial intervals (4, 30, and 60 
sec.) were used with similarity varied among 
experiments. In three experiments (Is, IIs, and 
IIIs), stimulus similarity was varied from low to 
high with response similarity constant at zero. 
In the other three experiments (Ir, IIr, and IIIr), 
response similarity was varied from low to high 
by simply “turning over” the'lists used for stim- 
ulus variation so that similarity among stimuli 
was constant at zero while response similarity 
varied from low to high. To summarize: 
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Stimulus Response 

Experiment Similarity Similarity 
Is Low Zero 
IIs Medium Zero 
IIIs High Zero 
Ir Zero Low 

IIr Zero Medium 
IlIr Zero High 


The zero similarity indicates only that no 
known systematic meaningful similarity existed 
among the items. 

Lists —All adjectives were taken from 
Haagen’s (3) list. In his study Haagen scaled 
400 pairs of adjectives for synonymity, vividness, 
and associative connection, and 480 words were 
rated singly for familiarity. In the present 
experiments synonymity was varied among 
stimuli (or among responses); hence, the associa- 
tive connection and vividness between stimuli 
and responses actually learned by S is unknown. 
Familiarity of single words was constant for all 
lists. Lists contained ten pairs. The ten words 
(stimuli or responses) of a given list among which 
similarity was varied were divided into three 
sets, one set containing four words, and two sets 
containing three words each. Similarity was 
varied by changing the degree of synonymity 
within each set. Haagen’s scale varies from .5 
(high synonymity) to 6.5 (low synonymity). The 
synonymity within sets for high similarity was .8 
to 1.6; for medium similarity, 2.1 to 2.7; and for 
low similarity, 3.1 to 6.0. An illustration of ten 
words used at each level of similarity follows: 


High Similarity: (angry, enraged, pained, wrath- 
ful) (royal, regal, kingly) (double, dual, two- 
fold) 

Medium Similarity: (anxious, distressed, troub- 
led, disturbed) (cautious, discreet, guarded) 
(liquid, flowing, solvent) 

Low Similarity: (fickle, heedless, fitful, giddy) 
(sickly, bedfast, feeble) (complete, perfect, 
utter) 


In constructing the lists, of course, similar 
items were not placed together in sets but were 
scattered throughout the list. In pairing words 
an attempt was made to eliminate all obvious 
associative cues of sound and meaning. Further- 
more, in making the three lists for a given experi- 
ment care was taken to keep interlist similarity 
at a minimum. 

The lists were presented at a 2:2-sec. rate 
(i.e., 2 sec. for the stimulus alone and 2 sec. for 
the stimulus and response together) by Hull-type 
drums, and three orders for each list were used 
to avoid serial learning. A single practice list, 
learned by all Ss, had medium stimulus similarity. 

Specific conditions. On the practice day S 
first learned the practice list to a criterion of six 
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correct responses on a single trial by massed 
practice (4 sec. between trials). Instructions 
were then given for symbol cancellation which 
was used as the rest-interval activity for the dis- 
tributed conditions (7). Following this, S con- 
tinued learning the practice list with 30 sec. 
between trials until one perfect recitation was 
achieved. After a 5-min. rest the list was 
recalled and relearned. 

For each experiment the three experimental 
conditions involved three different intertrial 
rests, namely, 4, 30, and 60 sec. Learning was 
always carried to one perfect trial. Recall and 
relearning by massed practice of each list took 
place 24 hr. following original learning. 

With 36 Ss, three lists, and three intertrial- 
rest conditions, counterbalancing of both lists 
and conditions was complete for each experi- 
ment. Except for differences in similarity, all 
six experiments were identical. 


REsULTS 


Practice list—The mean number of 
trials to learn the practice list by each 
group of Ss, and the mean number of 
errors per trial in learning the practice 
list are shown in Table 1. The F for 
both trials and errors is less than one. 
The product-moment correlation be- 
tween trials to learn the practice list 
and trials to learn all experimental 
lists combined for all experiments is 
.60 + .07; for errors the correlation is 
.71 + .07. The comparability of the 
six groups of Ss is high enough to 
allow direct comparison of perform- 
ance on experimental lists. 

Trials to learn experimental lists.— 
The mean number of trials to learn 


TABLE 1 


Mean TRIALS AND Errors Per TRIAL 
on Practice List* 











Experiment Trials Errors 
Is 20.00 1.56 
IIs 18.67 1.57 
IIIs 19.67 1.41 
Ir 20.44 1.30 
IIr 19.64 1.38 
Ilr 19.92 1.36 











* The om for trials based on within-groups variance 
is 1.53; for errors, .12. 
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with three intertrial intervals and 
three levels of stimulus and response 
similarity is given in Table 2.. Experi- 
ments Is, IIs, and IIIs have been 
analyzed separately in Table 3. In 
this, and all other analyses of variance, 
homogeneity of variance was ascer- 
tained before analyses were made. In 
Table 3 the appropriate error term for 
analyzing effect of similarity is Sub- 
jects/Similarity. All other effects re- 
quire the use of Pooled Ss X Practice/ 
Similarity. Portions of the analysis 
which evaluate differences in list diffi- 
culty (Lists in Table 3) within a given 
level of similarity are not presented. 
A complete discussion of this method 
of analysis has been given elsewhere 
(1). 

As shown in Table 3, similarity is a 
significant source of variance. Ob- 
serving the means of Table 2 it can be 
seen that this results largely from the 
fact that the lists of Exp. IIs took 
considerably longer to learn than did 
the lists of Exp. Is and IIIs. Tostate 
the empirical generalization: as inter- 
list similarity increases, trials to learn 
first increase and then decrease. In- 
tertrial interval is not a significant 
source of variance; as in the case with 
paired nonsense syllables (9), dis- 
tributed practice up to 60 sec. is shown 
to have no influence on rate of learning. 


- TABLE 2 


Mean Triats TO LEARN ExpERIMENTAL Lists 
UNDER ALL ConDITIONS 














Intertrial Interval 
Experiment 

4 sec. 30 sec. 60 sec. 
Is 14.67 16.92 14.36 
IIs 21.42 19.14 21.33 
IIIs 17.44 17.03 16.86 
Ir 19.08 15.25 17.78 
IIr 16.56 17.17 15.83 
IIIr 16.22 16.50 16.58 














*For Exp. Is, IIs, and IIIs, the om based on Ss 
X Practice/Similarity is .96; for Exp. Ir, Ir, and IIIr, 
the corresponding value is .91. 


TABLE 3 


ANALYsIs OF VARIANCE FOR TRIALS TO LEARN 
For Exp. Is, IIs, anp IIIs 











Source df =. F* 
Similarity 2 | 789.3704 | 5.41 
Ss/similarity 105 | 145.8586 


Intertrial interval 
Interval X similarity 


Practice 538.9537 | 16.61 
Practice X similarity 28.9074 

Lists 6 | ———_** 

Ss X practice/similarity | 198 | 32.4428 

Total 323 














* With 2 and 100 (105) df, F at .05 level is 3.09, at 
01, 3.41. With 2 and 200 (198) df, F at .0S is 3.04 and 
at .01, 4.71. With 4 and 200 (198) df, F at .05 is 2.41, 
and at .01, 3.41. 

** This mean square has no meaning except when 
broken down to show differences in list difficulty within 
each similarity level. 


An analysis for Exp. Ir, Ilr, and 
IIIr, comparable to that presented in 
Table 3 for stimulus variation, has 
revealed that neither intertrial inter- 
val nor similarity contributed signifi- 
cantly to the variance. As can be 
seen in Table 2, excepting Exp. IIs, 
all means fall fairly close to each other. 
The fact that variation in response 
similarity produces no appreciable 
difference in rate of learning confirms 
previous work with paired nonsense 
syllables (9). 

Errors during learning.—The mean 
number of overt errors per trial for 
variation in stimulus similarity are 
plotted in the left portion of Fig. 1, 
while corresponding values for re- 
sponse similarity are plotted in the 
right portion. Looking first at the 
curves for stimulus variation it can be 
seen that there is little relationship 
between error frequency and stimulus 
similarity. The F is far from signifi- 
cant. On the other hand, the errors 
clearly decrease as a function of inter- 
trial interval. The F is 6.87, with an 
F of 4.71 needed for the 1% confidence 
level. This confirms previous findings 
with paired nonsense lists (9). 
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Fic. 1. Mean number of errors per trial during learning. The curves to the left are for 
variation in stimulus similarity, those to the right, response similarity. 


Figure 1 shows an increase in error 
frequency associated with an increase 
in response similarity. The F is 3.57, 
which is significant between the .01 
and .05 levels of confidence. The 
same finding was also present in paired 
nonsense syllables (9). A trend to- 
ward decreasing error frequency with 
increasing intertrial interval can be 
observed in Fig. 1 for the experiments 
dealing with response similarity. The 








PER CENT CORRECT AT RECALL 











1-5 6-9 10-43 14-19 20Qven 
NUMBER OF REINFORCEMENTS 


Fic. 2. Recall as a function of stimulus simi- 
larity with number of reinforcements during 
learning held constant 


F reflecting this trend is 3.18, which 
has an associated probability of .05. 

Retention.—Recall and relearning 
took place after 24 hr. By either of 
these measures neither similarity nor 
intertrial interval proved to be a sig- 
nificant source of variance. This was 
true for the three experiments dealing 
with stimulus similarity and for those 
dealing with response similarity. In 
all six experiments recall following the 
60-sec. intertrial learning conditions 
was better than following the massed 
conditions (4-sec. interval) but these 
trends were not great enough to pro- 
duce statistical reliability. 

The recall results may be most 
clearly shown by item analyses in 
which items having comparable num- 
ber of reinforcements (correct re- 
sponses) during learning are grouped 
together, and percentage correct at 
recall for each grouping determined. 
Disregarding intertrial interval, this 
analysis has been made for Exp. Is, 
IIs, and IIIs, and presented in Fig. 2. 
No point is determined by less than 
73 items. It can be seen that there 
is some trend toward poorer recall 
with higher similarity than with low. 














DISTRIBUTION 


The medium-similarity list, which 
took longest to learn, shows the great- 
est loss, especially with the weaker 
items. This evidence, although quite 
meager, tends to support the Gibson 
theory. The over-all picture, how- 
ever, is that our technique for manipu- 
lating similarity has not produced 
large differences in learning and only 
insignificant differences in retention. 

Another item analysis has been 
made for recall as a function of inter- 
trial interval, disregarding similarity. 
All six experiments have been com- 
bined in obtaining the values for Fig. 3 
and no point is based on less than 165 
items. A trend in this graph which 
may be important is that with low 
levels of reinforcement recall is better 
following distributed practice than 
following massed. Maximum differ- 
ences (between 4-sec. and 60-sec. 
curves) are about 10%, which are 
highly significant statistically. Al- 
though we realize that testing extremes 
only is not statistically justified, the 
differences are here noted and further 
discussion of this matter will be given 
in the next section. 
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Discussion 


Little support has been found for 
the Gibson (2) hypothesis that the 
greater the stimulus generalization the 
more rapid the forgetting. However, 
it might be argued that since our 
method of manipulating similarity 
did not increase errors and trials in 
learning, the experiments are inade- 
quate tests of the Gibson hypothesis. 
Yet, if these experiments are consid- 
ered inadequate tests of the hypoth- 
esis, it will be extremely difficult to 
obtain satisfactory tests using mean- 
ingful verbal materials. It is our 
opinion, taking the evidence of the 
present experiments and previous ones 
(2, 8, 9), that the Gibson hypothesis 
is unsatisfactory; retention as meas- 
ured by recall at least is not related 
to stimulus similarity when degree of 
learning is constant. 


The present results show that variation in 
stimulus similarity produced significant differ- 
ences in learning but the relationship was not 
direct. Low- and high-similarity lists were 
learned at about the same rate and both faster 
than the medium-similarity lists. With paired 
nonsense lists (9) the relationship between trials 
to learn and stimulus similarity was direct. Be- 
fore accepting the results as indicative of intralist 
similarity, we should inquire as to whether or not 
some other dimension of the lists may have 
varied to produce these results. It is conceiv- 
able, for example, that in pairing items in con- 
structing the lists, associative connection for the 
different levels of similarity might have varied. 
While this did not seem likely, we had 15 ad- 
vanced undergraduate students scale the 90 pairs 
along a dimension of “how much the first word 
suggested or seemed to go with the second.” 
Using a five-point scale the results gave no evi- 
dence that the 30 pairs of words (3 lists of 10 pairs 
each) for a given level of similarity differed from 
either of the other two levels in associative 
connection. 

Quite indirect supporting evidence that the 
lists did not differ appreciably on dimensions 
other than similarity is found in the three experi- 
ments dealing with response similarity. Here it 
was found that as similarity increased, errors in- 
creased but rate of learning remained about 
constant. This result is exactly the same as 
found with paired nonsense syllables (9). For 
the present, therefore, the evidence gives no 
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reason not to conclude that the findings with 
manipulation in stimulus similarity are to be 
attributed to stimulus similarity. Accepting 
this, we still have no adequate explanation of 
why the lists of high stimulus similarity were 
learned more rapidly than the lists of medium 
stimulus similarity. 

As noted above, increase in response similarity 
produced no increase in trials to learn but did 
result in increased overt error frequency. Our 
explanation for this is essentially the same as 
that used in accounting for the same results in 
learning paired nonsense lists. Similarity among 
syllables is increased by increasing the number 
of repeated letters. Therefore, as similarity 
increases, the number of different letters to be 
remembered by S decreases, which should facili- 
tate acquisition. But, at the same time, the 
problem of discriminating which combination of 
letters is paired with each stimulus provides a 
task of increasing difficulty as similarity in- 
creases. Assuming that these two tendencies 
are in rough balance, we would not expect 
response similarity to effect rate of learning but 
errors should increase with similarity. In the 
case of meaningful similarity in the present 
experiments, it will be remembered that simi- 
larity was increased by increasing synonymity 
among three sets of adjectives within each list. 
With this increase in similarity the number of 
clear-cut ideas or concepts decreases. We are 
saying that with low similarity among words in 
a given set (e.g., complete, perfect, utter) there 
are three fairly distinct ideas or concepts to be 
learned, while with a high-similarity set (e.g., 
kingly, royal, regal) there is only a single idea or 
concept to be learned. Thus, as similarity in- 
creases, there are fewer distinct ideas to be 
remembered, but the problem of discriminating 
which representative or sample of the idea goes 
with a given stimulus becomes more difficult. 
Again, assuming that these positive and negative 
factors are in rough balance, the present results 
may be expected. 

We have found that during learning overt 
errors decrease directly with intertrial interval. 
This was also found with paired nonsense syl- 
lables (9). However, we have no added clues 
from the present results as to whether this repre- 
sents “differential forgetting” in the McGeoch 
(5) theoretical sense or whether it represents an 
error-depressant effect of rest-interval activity, 
or whether it indicates something else. We have 
no evidence that the phenomenon is related to 
speed of learning. 

Finally, certain relationships between recall 
results of the present experiments and those of 
previous experiments need to be pointed out. In 
the second study (6) in this series it was found 
that with paired-adjective lists of low similarity 
recall was significantly better following dis- 


tributed practice than following massed practice. 
With high-similarity lists (also in that second 
study) no such differences were apparent. In 
the present experiments no significant differences 
in recall following massed and distributed prac- 
tice were recorded. Thus, it would appear that 
for the low-similarity lists there is a contradiction 
between the two reports. However, in the 
earlier study, lists were learned to a criterion of 
eight out of ten items correct on a single trial 
before the retention interval, while in the present 
experiments the criterion was one perfect trial. 
This suggests the hypothesis that whether or not 
distribution of practice facilitates retention de- 
pends upon the level of learning attained before 
the retention interval is introduced. More spe- 
cifically, the over-all evidence (to be outlined 
shortly) gives support to two relationships: (a) 
when an item is reinforced many times, it will be 
better recalled if acquired under massed practice 
than if acquired under distributed practice; (b) 
when an item is reinforced a small number of 
times, it will be better recalled if acquired under 
distributed practice than under massed. 

For the present, the terms “few” and “many” 
reinforcements cannot be given specific quanti- 
tative meaning. And, it should be apparent 
that with an intermediate number of reinforce- 
ments we would expect neither massing nor dis- 
tribution to facilitate retention. The evidence 
in support of these above two principles may be 
summarized as follows: 


1. Hovland (4) found better retention follow- 
ing distributed learning of serial nonsense lists 
than following massed learning. We have found 
just the opposite (8). Hovland’s Ss were very 
well practiced, learning nearly twice as fast as 
the Ss we used. Fast learning means most items 
will be reinforced only a few times; slow learning 
indicates quite the opposite. Furthermore, we 
have shown that in the serial learning of nonsense 
syllables (8) massed practice facilitates the recall 
only of items reinforced many times. 

2. When paired-associate lists of adjectives 
were learned to eight out of ten correct (6) 
(averaging about ten trials), distributed practice 
facilitated retention. When lists required about 
16 trials to learn to the same criterion (because 
of high similarity), no differences were apparent 
on recall. In the present experiments where 
learning was carried to one perfect trial (requir- 
ing from 15 to 20 trials), no appreciable differ- 
ences in recall were observed. 

3. The item analysis of the recall of the pres- 
ent experiments (Fig. 3) shows that for items 
given few reinforcements, retention is better 
following distributed than following massed prac- 
tice. There is a suggestion in this figure that 
with many reinforcements, massed practice is 
followed by better recall than distributed. 
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In our series of studies we have no evidence 
which directly contradicts the two relationships 
suggested above. In one study (9) using paired 
nonsense lists the results neither affirm nor deny 
the two relationships. It is possible that item 
analyses are not satisfactory for testing such 
propositions since when, within a given list, some 
items are reinforced many times and others only 
a few, certain interactions may take place to 
minimize differences in recall. Because of this 
possibility, the only certain way of determining 
whether or not the two principles are sound is 
by manipulating degree of learning of list as a 
whole before the retention interval. 

If the suggested relationships can be cleanly 
verified in subsequent experiments, the explana- 
tory problem then becomes clear and theoretical 
explanations can be attempted. This has not 
been true as yet because of the apparent contra- 
dictions in the empirical findings. 


SUMMARY 


The variables of the present experi- 
ments in learning of paired-adjective 
lists were: (a) intertrial interval (4, 
30, and 60 sec.); (b) three degrees of 
stimulus similarity, and (c) three 
degrees of response similarity. Simi- 
larity was manipulated by varying the 
degree of synonymity among sets of 
words within alist. A separate experi- 
ment was performed for each level of 
similarity with intertrial rest varied 
within each experiment. Learning 
was always carried to one perfect trial 
with retention measured after 24 hr. 
The major results were as follows: 


1. Intertrial interval was not related 
to either learning or retention. 

2. Variation in response similarity 
did not influence rate of learning, but 
Overt errors increased directly with 
similarity. 

3. Variation in stimulus similarity 
had a complex effect on rate of learn- 
ing, the medium-similarity lists taking 
longest to learn. 


4. During learning, overt errors 
decreased directly with intertrial 
interval. 


5. Neither response similarity nor 
stimulus similarity produced signifi- 
cant effects on retention. 


No support was found for Gibson’s 
theory which predicts that retention 


will decrease as a function of stimulus 
similarity. An evaluation of several 
previous experiments in relation to 
the present ones led to the proposal of 
two relationships which may account 
for discrepancies among experiments 
and clarify direction of subsequent 
research. These are: (a) when an 
item is reinforced a small number of 
times, it will be better retained if 
acquired by distributed practice than 
if acquired by massed practice; (d) if 
an item is reinforced many times, it 
will be better retained following 
massed than following distributed 
practice. 


(Received August 18, 1952) 
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A CONFIRMATION OF WEBB’S DATA CONCERNING THE 
ACTION OF IRRELEVANT DRIVES! 


CARL M. BRANDAUER 


Columbia University 


Webb (4) has reported data indi- 
cating that the conditioned strength 
gained by an instrumental response, 
when conditioning is carried out under 
one drive, can be extinguished under 
another drive. In his experiment, a 
panel-pushing response by food-de- 
prived rats was conditioned with food 
reinforcement, and extinction was 
done with several subgroups under 
varying amounts of water depriva- 
tion. A positive curvilinear relation 
was obtained between the number of 
hours of water deprivation and the 
number of extinction responses emitted 
up to a given criterion, and Webb took 
this functional relationship as evidence 
of the transfer of response strength to 
an “irrelevant” drive. 

Two experiments performed re- 
cently in this laboratory, however, 
suggested that Webb’s experimental 
design had not included a possibly 
crucial control, namely, that of the 
effect of deprivation level upon pre- 
conditioning response strength or 
“operant level’ (3). In both experi- 
ments involving hours of food depri- 
vation and operant-level bar-pressing 
rate, a positive relation between these 
two variables was found.” While it is 
true that Webb took some measures 


1 The research presented here was part of a 
thesis submitted in partial fulfillment of the 
requirements for the degree of Master of Arts in 
the Faculty of Pure Science, Columbia Univer- 
sity. The author wishes to express his deep 
gratitude for the constant help and advice given 
him by Professor W. N. Schoenfeld. 

*C. P. Crocetti and R. B. Funston obtained 
these results separately in experiments con- 
ducted as part of unpublished theses in this 
Department. 


of the preconditioning strength of the 
panel-pushing response in his experi- 
ment, the determinations were made 
for a much shorter period of time than 
his extinction period, and then only 
at one drive level. Thus, the possi- 
bility exists that Webb’s conclusions 
regarding transferability, supported 
only by the positive relation he ob- 
tained between extinction responses 
and drive level, may be erroneous. 
However, Webb’s results, if confirm- 
able, would seem important to general 
behavior theory. The present note 
offers such a corroboration of the find- 
ing, which was obtained in an experi- 
ment primarily concerned with other 
aspects of the drive problem. 


APPARATUS AND PROCEDURE 


Only a brief outline of the apparatus and pro- 
cedure is given here, largely because the results 
were confirmatory in nature. The apparatus 
was of the bar-pressing type described in (2), 
with a bar j in. thick to minimize bar-biting. 
Recording was done by electrically operated 
counters, and the experiment was performed in a 
room where temperature and illumination could 
be controlled within close limits and where extra- 
neous noises were almost completely eliminated. 
The animals were individually housed in the 
combined living-experimental boxes described in 
(2). 

Data are presented from 32 male albino rats 
of Wistar stock, 90-110 days old at their arrival 
in the laboratory. Upon arrival, Ss were placed 
on a 22-hr. watering and feeding schedule for 
five days. The rhythm was 6o set up that the 
experimental hour fell at a time when Ss were 
22-hr. thirsty and zero hours hungry. On the 
sixth day, operant-level determinations were 
begun during the experimental hour indicated 
above and continued for seven days. The Ss 
were then split into three groups, matched for 
response frequency during the prior operant-level 
determination, and four days were allowed for a 
shift in the feeding and watering rhythm. The 
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TABLE 1 
Operant Levet anv Extinction Data Responses PER Hour 
Operant Level 
Drive Extinction First Day 
Group N — of Same Drive Different Drives 
Thirst 

Mean Median Mean Median Mean Median 
I 11 0 14.86 15.00 9.07 5.00 19.74 14.00 
II 10 11 15.50 12.75 10.35 9.88 21.70 22.50 
III ll 22 15.45 13.50 14.43 12.00 30.55 19.00 

All 32 15.26 13.25 



































shift was of such a nature as to provide a Group I 
(N = 11) which was neither hungry nor thirsty 
during the experimental hour; a Group II 
(N = 10), 11-hr. thirsty but not hungry; and a 
Group III (N = 11), which was 22-hr. thirsty 
but not hungry. Operant-level determinations 
were continued under these conditions for ten 
days. On the following day, all Ss were given 
water when about 1I-hr. hungry, the bars in- 
serted, and each response reinforced with a food 
pellet. This was continued until 40 reinforce- 
ments had been given. Two days were then 
used for the re-establishment of the feeding and 
watering schedule in force during the second set 
of operant-level determinations, and were fol- 
lowed by five days of extinction under the drive 
conditions outlined for this set of operant-level 
determinations. 


REsSULTs AND Discussion 


The mean number of responses 
emitted by all Ss during each of the 
last two sessions of operant-level 
determination under the same drive 
level, i.e., 22-hr. thirst and O-hr. hun- 
ger, was 15.26 (Group I, 14.86; Group 
II, 15.50; Group ITI, 15.45), and is in 
close agreement with results obtained 
in the same apparatus by Schoenfeld, 
Antonitis, and Bersh (3). As can be 
seen, the match of the three groups 
was very close, the difference between 
the extreme groups being .64 responses 
per hour. As summarized in Table 1, 
the mean number of responses emitted 
per hour in the last four sessions of 
operant level under different drive 


levels was 9.07, 10.35, and 14.43 for 
the 0-, 1l-, and 22-hr. deprivation 
groups, respectively. The difference 
between extreme groups, Groups I and 
III, was found to be significant at the 
.01 level of confidence, using Festin- 
gers nonparametric test (1). The 
other differences were not significant 
at the .05 level. 

The extinction data from the first 
extinction session are of primary inter- 
est here. As can be seen, the mean 
number of responses emitted during 
this hour by Groups I, II, and III 
was, respectively, 19.74, 21.70, 30.55. 
The difference between the number of 
responses emitted during operant level 
and extinction by the same group is 
significant at the .05 level of confi- 
dence for all groups. Hence it is con- 
cluded that, even with the present 
control, Webb’s finding is reliable. 


SUMMARY AND CONCLUSIONS 


An experiment designed to test 
Webb’s conclusions concerning the 
action of “irrelevant” drives was per- 
formed with the addition of an experi- 
mental control over the effect of drive 
level upon preconditioning response 
strength. Male albino rats in a bar- 
pressing apparatus were used, the 
Ss being divided into three groups 
matched on the basis of operant-level 
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responding under 22-hr. water depri- 
vation. Further operant-level deter- 
minations were made with the groups 
assigned to 0-, 11-, and 22-hr. water- 
deprivation levels. When these de- 
terminations were concluded, all Ss 
received 40 regular food reinforcements 
for the bar-pressing response while 11- 
hr. hungry and not thirsty. Extinc- 
tion was then carried out under the 


same conditions prevailing during 
the second set of operant-level 
determinations. 


The results obtained indicate that 
the strength gained by an operant 
response during conditioning under 
one drive can be elicited during extinc- 
tion under another drive. These data 
afford a confirmation of Webb’s find- 
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ings, even with the provision of the 
additional experimental control. 


(Received for early publication 
December 4, 1952) 
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RIGIDITY AS A FUNCTION OF ABSOLUTE AND 
RELATIONAL SHIFTS IN THE LEARNING OF 
SUCCESSIVE DISCRIMINATIONS! 


ARNOLD H. BUSS 
Carter Memorial Hospital, Indianapolis 


Rigidity is typically defined in terms 
of failure to shift from one set of 
responses (one mode of problem solv- 
ing, one basis for sorting) to another 
when the first is no longer reinforced. 
Several kinds of shifts may be distin- 
guished. A previous study (2) com- 
pared reversal shift (e.g., reversal of 
tall and short stimuli as the positive 
and negative cues in a height discrimi- 
nation) and nonreversal shift (e.g., a 
change from a form discrimination toa 
height discrimination). It was found 
that nonreversal shift resulted in more 
rigidity than reversal shift. 

In the present study absolute shift 
and relational shift were investigated. 
These shifts are similar to a transposi- 
tion shift in that Ss first learn to 
respond differentially to two stimuli 
and then are presented with other 
stimuli on the same continuum. In 
an absolute shift the same stimulus is 
negative in each of two successive dis- 
criminations. For example, Ss first 
learn a_ brightness discrimination 
(black positive, gray negative) and 
then are required to learn a different 
brightness discrimination (white posi- 
tive, gray negative). In a relational 
shift the positive and negative stimuli 
bear the same relationship to each 
other in both discriminations, but the 
positive stimulus of the first discrimi- 
nation becomes the negative stimulus 
of the second discrimination. For 
example, Ss first learn a brightness 
discrimination (gray positive, black 
negative) and then learn a similar dis- 


1 This experiment was carried out at the State 
University of Iowa. 


crimination (white positive, gray neg- 
ative); in both discriminations the 
positive stimulus is the brighter of the 
two. 

If Ss tend to respond relationally, 
absolute shift should result in slower 
learning of the second discrimination 
(more rigidity) than relational shift. 
If Ss tend to respond absolutely, rela- 
tional shift should result in more 
rigidity than absolute shift. 


MetTHOoD 


Subjects.—The Ss were 55 college students 
recruited from an introductory psychology class. 
They were assigned randomly to two groups, 
with the restriction that there be 20 acceptable 
Ss in each group. The criterion of acceptability 
was five consecutive correct responses in Series 1. 
If an S did not meet this criterion by the sixtieth 
trial, he was discarded. Nine Ss in Group I and 
six Ss in Group II were discarded, leaving 20 
acceptable Ss in each group. 

Materials.—The stimuli were wooden blocks 
of various forms, colors, areas, and heights. 
“Form” refers to the shape of the top and bot- 
tom surfaces of the blocks. There were three 
forms: circular, square, and triangular. There 
were five colors: red, yellow, blue, green, and 
gray. For the square and triangular blocks 
there were three top surface areas, which were: 
1.56 sq. in., .56 sq. in., and .25 sq. in. For the 
circular blocks there were four top surface areas: 
1.49 sq. in., .68 sq. in., .31 sq. in., and .11 sq. in. 
There were three heights: 2 in., 1 in., and .5 in. 
Every block was different from every other block 
with respect to form, color, area, or height. The 
various colors, forms, and areas were divided 
equally among the three different heights. 

Procedure.—The E read the following instruc- 
tions to S: “You will be shown a series of blocks, 
one at a time. Some of the blocks are Ve 
(V-E-C) and some are not Vec. You don’t 
know what a Vec is at the start. The idea is to 
find out what a Vec is. Decide as best you can. 
If you think a block is a Vec, say ‘Vee.’ If you 
think a block is not a Vec, say ‘not Vee.’ If you 
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are correct, I’ll say ‘Right’; otherwise I’ll say 
nothing.” 

Each stimulus was presented singly for ap- 
proximately 5 sec. The S responded vocally. 
The E said “Right” if S’s response was correct, 
but E said nothing if S’s response was incorrect. 
There was no time interval between series of 
stimuli, and S was not told that there was more 
than one series. 

Design.—There were two groups of Ss and 
two series of stimuli. In both series Ss were 
trained to respond differentially to the height of 
the stimuli. The groups differed only in their 
first series; for Group I 2-in. blocks were the 
positive stimuli, and l-in. blocks were the nega- 
tive stimuli; for Group II 1-in. blocks were 
positive, and 2-in. blocks were negative. All Ss 
were trained to a criterion of five consecutive 
correct responses. 

In Series 2 both groups were trained on the 
same height discrimination: 4-in. blocks positive 
and l-in. blocks negative. The series, which 
consisted of 32 blocks, was divided into halves. 
Each half consisted of eight 1-in. blocks and eight 
4-in. blocks. Within each half of Series 2 the 
stimuli were presented in random order; the 
order differed foreach S. Series 1 was presented 
in random order, too, and the order differed for 
each S. 

Only two responses were available to S: 
saying Vec or saying not Vec. The strength of 
the Vec response is inversely proportional to the 
strength of the not Vec response. Therefore, it 
is necessary to consider the strength of only one 
of these responses. The Vee response was arbi- 
trarily selected, and hereafter when a response 
is referred to without designation, it is the Vec 
response. Also the terms “positive stimuli” and 
“negative stimuli” are used with reference to the 
Vec response only. 


RESULTS 


In Series I both groups were trained 
to a criterion of five consecutive cor- 
rect trials. The mean number of 
trials to the criterion in Series 1 was 
16.60 (SD = 7.62) for Group I and 
18.75 (SD = 11.26) for Group II. 
The ¢ value of .70 indicates that the 
difference between groups is not statis- 
tically significant. 

In Series 2 both groups were trained 
on a height discrimination with }-in. 
blocks positive and 1-in. blocks nega- 
tive. The frequency data are sum- 
marized in Table 1. The series was 
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TABLE 1 


Mean Number or VEC Responses 1n Eacu 
Hatr or Series 2 1n RELATION TO 
tHE HEIGHT oF THE STIMULI 








One-half Inch One Inch 
Group * 
Mean SD Mean SD 














First Half 
I 4.55 | 2.01 1.00 | 1.34 | 5.92** 
II 4.55 | 2.16 | 4.55 1.28 | 0 





Second Half 





I 6.350 | 2.17 | 1.35 | 182. | 477° 
II 5.20 | 2.16 | 3.85 | 2.20 | 1.86 




















*The ¢ test for correlated means (9) was used. 
Although the distributions of Vec response frequencies 
tended to be skewed, the distributions of differences 
were either not skewed or only moderately skewed. 

** > <.001 for 19 df. 


divided into halves in order to demon- 
strate changes in response frequencies 
as the series progressed. In both 
halves Group I responded more often 
to 3-in. blocks than to 1-in. blocks, the 
differences being significant at the .001 
level of confidence in both instances. 
In the first half Group II responded as 
often to l-in. blocks as to }-in. blocks. 
In the second half there was a differ- 
ence in response frequencies but the 
difference is statistically insignificant. 
Inspection of Table 1 reveals that 
the response frequencies of Groups I 
and II to 3-in. blocks were equal in 
the first half and only slightly differ- 
ent in the second half of Series 2. The 
significance of the difference between 
the response frequency of each group 
to 3-in. blocks in the second half was 
tested by means of the Festinger tech- 
nique (3). The F ratio of 1.21 was 
not significant for 166 and 114 df. 
The differences between groups in 
their response frequencies to 1-in. 
blocks were fairly large. For the first- 
half frequencies the Festinger tech- 
nique yielded an F ratio of 4.55, signi- 
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ficant at the .001 level of confidence 
for 240 and 11 df. For the second- 
half frequencies the F ratio of 2.85 
was significant at the .01 level of 
confidence for 60 and 11 df. Thus, 
the failure of Group II to learn the 
height discrimination in Series 2 may 
be attributed to failure to inhibit the 
tendency to respond to 1-in. blocks. 


DIscussIoNn 


There are two basically different 
theoretical approaches which might 
be applied to the present experimental 
situation, the relational view (4,6,7) 
and the reinforcement-inhibition view 
(10,11). According to the former, Ss 
learn a relationship, not an absolute 
response tendency. The Ss in Group 
II learn to respond to the shorter of 
two stimuli but not to the taller. 
When the stimuli are moved down a 
notch on the height continuum (Series 
2), Group II should transpose and 
tend to respond to the smaller of two 
stimuli. This should facilitate their 
learning of the new discrimination. 
The Ss in Group I learn to respond to 
the taller of two stimuli. In Series 2 
they should transpose and tend to 
respond to the taller of the two stimuli. 
This should retard their learning. 

According to the reinforcement-inhi- 
bition view, Ss build up a strong 
response tendency toward stimuli of a 
given height and inhibit the response 
tendency toward stimuli of another 
height. Each time the response is 
reinforced in the presence of the posi- 
tive stimulus, there is an increment in 
the tendency to respond to that 
stimulus. Each time the response is 
nonreinforced in the presence of the 
negative stimulus, there is a decrement 
in response tendency. The discrimi- 
nation is learned by means of differ- 
ential reinforcement. 


The Ss in Group II first build up a 


response tendency toward I-in. blocks 
and inhibit the response tendency 
toward 2-in. blocks. In Series 2 they 
must extinguish the already strong 
response tendency to l-in. blocks and 
build up a response tendency to }-in. 
blocks. The strong response tendency 
to l-in. blocks acquired in Series 1 
should retard the learning of Group II 
in Series 2. 

On the other hand Group I first 
builds up a response tendency to 2-in. 
blocks and inhibits the response tend- 
ency to l-in. blocks. In Series 2 the 
tendency to respond to l-in. blocks 
has already been partially extin- 
guished and Group I has only to build 
up a response tendency toward }-in. 
blocks. This should facilitate the 
learning of Group I in Series. 2. 

The data clearly support the rein- 
forcement-inhibition view. Group I 
learned the discrimination in Series 2, 
and Group II did not. Furthermore, 
the only difference between groups 
was in their response frequencies to 
l-in. blocks (see Table 1). In-each 
half of Series 2 Group II responded to 
l-in. blocks significantly more often 
than Group I. This coincides with 
the prediction made by a reinforce- 
ment-inhibition view. 

It should be noted that the present 
experimental situation differs from 
typical transposition experiments (1, 
5,8) in that: (a) the stimuli are pre- 
sented one at a time, rather than in 
pairs; (b) the same stimulus is néver 
presented twice so that Ss must learn 
a response to a class of stimuli; and 
(c) Ss are not merely tested on the 
second set of stimuli but are required 
to learn a new discrimination. Any 
one of these three conditions might be 
enough to alter the predictions made. 
Therefore the present experiment can- 
not be considered as a test of the two 
theoretical positions. 








SUMMARY 


The present experiment compared 
the effects of absolute shift and rela- 
tional shift upon learning of a subse- 
quent discrimination. The stimuli 
were wooden blocks of various forms, 
colors, and sizes. There were two 
groups of human Ss and two series of 
stimuli. In Series 1 of Group I (abso- 
lute shift) 2-in. blocks were the posi- 
tive stimuli, and l-in. blocks were 
negative. In Series 1 of Group II 
(relational shift) l-in. blocks were the 
positive stimuli, and 2-in. blocks were 
negative. In Series 2 both groups 
learned the same discrimination: 4-in. 
blocks positive and l-in. blocks 
negative. 

It was found that Group I learned 
the discrimination in Series 2 while 
Group II did not learn. The inability 
of Group II to learn the discrimination 
in Series 2 was attributed to a failure 
to inhibit the tendency to respond to 
l-in. blocks. These results were in- 
terpreted as being in support of a 
reinforcement-inhibition view rather 
than a relational view of discrimina- 
tion learning. 


(Received August 11, 1952) 
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Yale University 


The present experiment was designed 
to study the effect of verbalization 
occurring at the time of observation of 
stimulus materials upon subsequent 
memory of the materials. An earlier 
investigation reported by Bartlett (1) 
suggests that verbalization may in- 
crease accuracy of retention. He 
dichotomized his Ss into two groups 
“‘vocalizers” and “‘visualizers’’—ac- 
cording to whether or not they overtly 
verbalized during observation, and his 
data suggest that “‘vocalizers”’ tend 
to recall details more accurately and to 
introduceless erroneous detail into their 
descriptions than do “‘visualizers.” 

A related phenomenon, the effect of 
labeling upon the reproduction of am- 
biguous figures, has been demonstrated 
in a well-known study by Carmichael, 
Hogan, and Walter (3). In that study 
reproductions made immediately after 
observation of a series of figures were 
shown to resemble the labels provided 
by E at the time of observation. 
While the authors interpreted this 
result as indicating that labeling influ- 
enced perception of the figures, it is 
equally plausible to assume that the 
effect was due to the influence of 
labeling upon retention. 

The present experiment was de- 
signed to test the prediction, based 
upon Bartlett’s findings, that labeling 
will improve the accuracy of retention. 


'This study was carried out as part of a co- 
ordinated series of studies on factors relating to 
communication and attitude change. The grant 
from the Rockefeller Foundation for this pro- 
gram is gratefully acknowledged. The writers 
wish also to thank Professors F. D. Sheffield and 
G. A. Kimble for their reading of the manuscript. 


Two groups of elementary-school chil- 
dren observed common objects and 
were tested a week later for retention. 
One group labeled the objects during 
observation, and the other did not. 
Neither group had any indication that 
they would be tested later for reten- 
tion. In contrast to Bartlett’s pro- 
cedure in which Ss were dichotomized 
after observation according to whether 
or not they spontaneously verbalized, 
in the present study verbalization was 
experimentally manipulated and Ss 
were assigned at random to the ver- 
balization or control groups. The 
latter procedure has the advantage of 
rendering the act of verbalization 
independent of any individual differ- 
ences correlated with the tendency to 
verbalize spontaneously. 

A secondary question to be investi- 
gated is the relationship between the 
effect of verbalization and the type of 
test employed in measuring retention. 
It was assumed that at least one effect 
of verbalization during observation 
would be to increase the strength of 
the same verbal responses in the test 
situation. With the further assump- 
tion that a recognition test in verbal 
form (with printed names), as com- 
pared with one in visual form (with 
photographs), would be relatively 
more sensitive to such a differential 
strength of verbal responses, it was 
predicted that the effect of verbaliza- 
tion would be more marked on a ver- 
bal form of the recognition test. 

A further evaluation of the effect of 
verbalization was provided by a 
written recall test administered to 
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separate subgroups after the same 
interval as the administration of the 
recognition tests 


PROCEDURE 


Seventy-two fifth-, sixth-, and seventh-grade 
school children were shown an array of 16 familiar_ 
objects. The verbalization group was given a 
sheet containing the names of the objects and 
was instructed to find, encircle, and pronounce 
the names as £ pointed one at a time to the 
actual objects. Under control (non-verbaliza- 
tion) conditions a photograph of the array of 
objects was substituted for the sheet of names, 
and Ss encircled the picture of each object indi- 

(cated by E£. In neither group were Ss led to 
expect later tests of retention. 

™ A week later, recall and recognition tests were 
administered. Both visual and verbal forms of 
the recognition test were employed: half the test 
given each S consisted of photographs of the 
test items and the remaining half of printed 
names. The visual and verbal halves were pre- 
sented in counterbalanced order. 

The particular observational procedures em 
ployed were designed with the purpose of keeping 
the two conditions as alike as possible in all 
respects except verbalization. Since it might be 
argued that any active response during observa- 
tion, whether or not verbal, might require 
greater attention to the stimulus and, hence, lead 
to superior retention, it was felt necessary to 
require an active, nonverbal response in the con 
trol situation. This consideration led to the 
selection of the procedure of encircling names or 
pictures of objects in the experimental and con 
trol group, respectively, on the assumption that 
the two methods require comparable degrees of 
attention on the part of Ss. 

While some implicit verbalization may have 
occurred spontaneously under control conditions, 
it is unlikely that it occurred with a frequency 
approaching the 100% value produced by the 
experimental conditions. To the extent that 
spontaneous verbalization did occur, it would 
only operate in the direction of reducing the 
experimental effects. 

Subjects —The Ss were pupils at the Worth- 
ington-Hooker Elementary School in New 
Haven.2 The Ss in the sixth and seventh grades 
were seen in observation sessions on one day and 

* The authors wish to express their thanks to 
Miss Helen Goodman, psychologist for the New 
Haven Board of Education, and to Mr. Francis 
Smith, principal of Worthington-Hooker Ele- 
mentary School, for their cooperation in making 
pupils available to serve as Ss. 
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those in the fifth grade on the following day. 
Within each grade Ss were randomly divided 
between the verbalization and control groups. 

Stimulus objects —One set of 16 objects was 
selected for original observation and a second set 
of 16 was selected as “confusion” objects to be 
included with the others on subsequent recogni- 
tion tests. The objects used included toys, mini 
ature animals, school supplies, common house 
hold and personal articles, a rose, and a pine 
cone. Considerations in choosing the items wer« 
that they should be visually distinct and recog- 
nizable, easily labeled without ambiguity, famil- 
iar to the Ss, and should cover a range from 
relatively low to relatively high interest. A 
photograph of the objects originally observed is 
presented in Fig. 1. Eight confusion objects, 
intermixed with eight original objects, are pic- 
tured in Fig. 2 as they appeared on the visual 
form of one half of the recognition test. 

Methods of observation.—The observation of 
the objects was conducted in individual sessions 
with S. The 16 objects to be observed were 
arranged in a four-by-four array on a mahogany 
desk top and were shielded from S’s view by a 
masonite screen until S was seated in a chair 
before them and the instructions were begun. 

In the experimental (verbalization) group two 
identical 83 X 11-in. mimeographed sheets bear- 
ing the names of all the objects in the array were 
presented to S. The names were typewritten in 
lower-case letters in a four-by-four array iden- 
tical to that of the corresponding objects on the 
desk. 

The screen shielding the objects from view 
was removed and S was instructed that he was 
to be given “a test of the way people see things” 
and that as £ pointed to each of the objects, S 
was to find its name on the first sheet, encircle it 
with a pencil, and pronounce it aloud. The S 
was further instructed of the identity of the 
arrangement of the names on the sheet and the 
objects on the desk. The E pointed to the ob- 
jects one at a time in a predetermined random 
order which differed for each S. After eight of 
the objects had been indicated by E, the first 
sheet was taken from S and he was instructed to 
continue on the second sheet in the same fashion. 
The £ continued by pointing to the remaining 
eight objects, following which the session was 
terminated. At no time was S given any indi- 
cation that he was to be later tested in any way. 

In the control (visualization) group the pro- 
cedure used was identical to that of the experi- 
mental group except that 44 X 10-in. halftone 
reproductions of a photograph of the array were 
substituted for the list of names and S was 
required to encircle the picture of each object as 
the actual object was indicated by FE. No men- 
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Fic. 1. 


Stimulus objects as viewed by Ss during the observation session. 


This photograph 


is a reproduction of the sheets on which Ss in the control group encircled pictures of objects during 


observation. 


tion of naming the objects was made in the 
instructions to this group. 

As nearly as possible, the picture of the orig 
inal array presented the objects from the same 
angle of regard and in the same spatial relation 
ships as they appeared when actually observed 
by the Ss. To increase the clarity of the picture 
the objects were photographed against a whit 
ground. 
photograph. 

Tests of retention—One week following the 
initial observation 24 Ss from the verbalization 
and 24 from the visualization group were given 
recognition tests, and the remaining Ss in each 


Figure 1 is a reproduction of this 


group were given written recall tests [he tests 
were administered, without prior announcement, 
to groups of Ss in a classroom setting. The 
recall and recognition tests were given in separate 
rooms. 

Construction of recognition test.—The complete 
recognition test, which included the 16 original 
objects plus 16 confusion objects, was presented 
in two halves, each including half of the original 
and half of the confusion objects. For each S 
one half of the test was presented in verbal form 
and the other in visual form. Halftone repro 
ductions of photographs and mimeographed lists 
of names were prepared for each of the two sets 





Fic. 2. 
of one of the actual test sheets used. 
fusion objects. 


Sample half of the recognition test in visual form. 


This photograph is a reproduction 


It shows eight original objects intermixed with eight con 
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of objects comprising the two halves of the test. 
These were similar to the sheets used during 
original observation in the control and experi- 
mental conditions, respectively. The position of 
each object in the recognition-test array differed 
from that in the original, but the same orienta- 
tion relative to the camera was maintained. On 
the printed sheets the name of each object ap- 
peared in the same relative position as did the 
picture of that object in the corresponding 
photograph. 

Individual test booklets were prepared by 
stapling together the appropriate test sheets. 
There were four different kinds of test booklets, 
distinguished by the arrangement of the two 
test sheets. If 4 and B represent the two halves 
of the test (consisting of different objects) and 
verbal and visual the form of the test, the four 
different arrangements of the two pages were as 
follows: (a) A-verbal, B-visual; (b) B-verbal, 
A-visual; (c) A-visual, B-verbal; and (d) B- 
visual, A-verbal. The four distinct test se- 
quences, when combined with the two methods 
of observation (verbalization and control), made 
eight different treatment combinations in each 
replication. Six replications were performed, 
two being made in each of the fifth, sixth, and 
seventh grades. 

Administration of recognition test.—The recog- 
nition tests were given a week after the observa- 
tion procedure in two classroom groups, one con- 
sisting of 16, and the other of 32, Ss. 

The test booklets were passed out to the Ss 
with instructions not to open them until told to 
doso. The group was told that the test included 
some names and some pictures of things and that 
they were to draw a circle around the names or 
pictures of those things that they had seen the 
previous week. They were instructed to make 
corrections of choices which they later decided 
were erroneous by marking the name or picture 
with an X, rather than by erasing the circle. 
The Ss were instructed that no one was to turn 
to the second page until everybody had finished 
the first and the group was given a signal by L. 
They were requested not to go back to the first 
page once they had proceeded to the second. 

Recall test—The remaining 24 Ss were first 
given a written recall test. This group was 
tested in a separate classroom at the same time 
as the others. Paper was passed out and the Ss 
were instructed to write down the names of as 
many of the objects they had seen as they could 
remember. When all Ss had finished, the papers 
were collected and this group was then given the 
recognition test. 


RESULTS 


For both recognition and recall tests 
separate analyses were made of correct 
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and incorrect responses. The analyses 
of variance of the correct and incorrect 
choices on the recognition tests are 
presented in Table 1. In this table 
comparisons between correlated and 
uncorrelated means are grouped sep- 
arately with their corresponding error 
terms (4, ch. 7). Table 2 gives the 
means of the recognition test results 
for the groupings of major interest— 
verbal vs. visual methods of observa- 
tion and verbal os. visual forms of the 
test. The mean number of correct 
and incorrect responses by the ver- 
balization and control groups on the 
recall test is presented in Table 3. 
Several aspects of the analyses of 
variance presented in Table 1 are of 
secondary interest and will be men- 
tioned briefly before discussing the 
principal results. Except for the dif- 
ference between the two forms of the 
test, the means for the comparisons 
of minor interest are not presented. 
Considering first the correct recogni- 
tions, it will be observed that there 
are significant differences between the 
two forms of the test and between the 
first and second halves of the test. 
These differences were in the direction 
of more items correctly recognized on 
the visual test and more on the first 
half of the test. There is also a sig- 
nificant interaction between test and 
position. This reflects the fact that 
the reduction in correct recognitions 
between the first and second half of 
the test occurs on the visual test only, 
i.e., that the score on the visual test is 
lower if it is preceded by the verbal 
test than if the two are presented in 
the reverse order. When the analysis 


is made in terms of number of items 
incorrect, neither the difference be- 
tween the two types of test nor the 
test-position interaction is significant, 
but there is a significant difference in 
number of errors on the two halves of 
This difference represents a 


the test. 


——— 
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VERBALIZATION 


reduction of errors on the second half 
of the test. 
Effects of verbalization during obser- 
vation 
Recognition test.—On the recogni- 
tion test there are significantly fewer 
errors made by the verbalization than 
by the control group (Tables 1 and 2). 
Similarly, the number of correct recog- 
nitions tends to be higher in the ver- 
balization group, although the F ratio 
of this difference falls slightly short of 
the .05 level of confidence. 
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For both correct and incorrect rec- 
ognition measures there is an inter- 
action of the type predicted, viz., 
that verbalization produces a larger 
effect on the verbal than on the visual 
recognition test. The F value of this 
interaction is significant at the .05 
level for both measures. As seen 
from Table 2, the nature of this inter- 
action differs somewhat for the two 
measures. The verbalization group 
made more correct recognitions on the 
verbal test, but slightly fewer on the 


TABLE | 


\NALYsIS OF VARIANCE OF Correct AND [NcorrecT Cuoices ON Recocnition Test 


1. Whole Plot 
\ original obseryation (verbalization 
vs. control) 
E school grade 
BC test X material 
BD test X position 
CD material X position 
ABC obs. X test X material 
\BD obs. X test X position 
\CD obs. X material X position 


Total treatment effects 


Error [replication error (including repli- 


cation interactions) + AE + BCF: 
+ BDE + CDE + ABCE + 
ABDE + ACDE] 


Il. Subplot 
B test (verbal os. visual form) 


C material (objects comprising two halves 


of test) 
D position (first vs. 
AB obs. X test 
\C obs. X material 
\D obs. X position 
BCD test X material X position 
\BCD obs. X test X material X position 


second half of test) 


Total treatment effects 
Error [replication error (including 


replication interactions) + BE + CE 


+ DE + ABE + ACE + ADE + 


BCDE + ABCDE] 


* Significant at .05 level of confidence. 
** Significant at .01 level of confidence. 


Correct Choices Incorrect Choices 
variance I variance I 
3.76 | 2.32 17.51 | 9.44** 
2 1.91 | 1.18 1.22 | .66 
l Ol | Ol St | 45 
] 10.01 | 6.17? Ol | Ol 
84 | .52 26 | .14 
l a 4. ae 1.26 | .68 
I Ol | Ol | 8 | 45 
I 2.34 | 144 | 51 28 
2.37 | 1.46 2.63 | 1.42 
38 1.62 1.85 
] 12.76 8.45 Q9 12 
l 84 6 2¢ 33 
] 7.60 5.03* 6.50 8.33* 
] 6.51 4.31* 4.59 5.88* 
l 09 06 84 1.08 
l 01 Ol 01 0] 
l 3.76 2.49 09 12 
l 6.51 | 4.31" 84 | 1.08 
8 4.76 3.15** 1.6¢ 2.12 
40 1.51 78 
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visual test. In terms of errors of 
recognition, the verbalization group 
showed a superior performance on 
both forms of the test, but the differ- 
ence on the verbal form was appreci- 
ably greater than that on the visual. 

Because the distribution of recog- 
nition scores, especially of errors, was 
skewed with the correct and incorrect 
choices bunched at the high and low 
ends of the scale, respectively, the F 
test was supplemented by Wilcexon’s 
rank difference test (5), which does 
not require the assumption of nor- 
mality. This test was applied to the 
differences between the verbalization 
and visualization groups on the verbal 
and visual tests separately and on the 
combined results of these two tests. 
The confidence levels given are for one 
tail. 

According to this analysis, the dif- 
ferences between the two groups on 
the combined tests were significant for 
measures of correct and incorrect 
recognition at less than the .05 and 
the .0O1 levels, respectively. On the 
verbal test both the larger number of 
correct choices and the lower number 
of errors by the verbalization group 
were significant at better than the .05 
level. On the visual test, the higher 
number of correct choices by the vis- 
ual group did not approach signifi- 


TABLE 2 
Recocnition Test Scores 
Mean NumBer or Correct AnD INCORRECT 
Cuoices BY VERBALIZATION AND CONTROL 
Groups ON VERBAL AND VISUAL 
Forms or Test. Eacu Mean 
Is Basep upon 24 Cases 




















Choices Form Verbalization| Control 
o of Test Group Group 
Correct verbal 6.38 5.46 
visual 6.58 6.71 
Incorrect verbal 0.55 1.83 
visual 0.92 1.33 














TABLE 3 


Mean Correct anno Incorrect Recatt Test 
RESPONSES BY VERBALIZATION 
anp Controt Groups 


| 
Incorrect 


Group N | Correct 
Responses 


| Responses 





TE ss cserves brvse site 


Verbalization 12 7.33 0.25 
Control 12 5.00 By, 
? <.01 <.01 


cance, but the reduced number of 
errors by the verbalization group 
yielded a p value of .06. When these 
data were pooled with the recognition 
data from the group first given the 
recall test, this difference is significant 
beyond the .05 level. The superior- 
ity of the verbalization group on this 
measure is of particular interest and 
will be discussed below. 

Recall test.—The results of the recall 
test further indicate the superiority 
of retention following verbalization. 
It will be seen from Table 3 that there 
is a higher number of objects cor- 
rectly recalled and a smaller number 
of errors by the verbalization than by 
the control group. Tested by Wil- 
coxon’s rank difference test, both of 
these differences are significant beyond 
the .O1 level (one-tail). 


Discussion 


The present data clearly support 
the hypothesis formulated at the out- 
set, that the accuracy of retention 
would be increased by verbalization 
at the time of stimulus observation. 
It remains to be discussed whether 


3 The results of the recall group on the recog- 
nition test were the same in all respects as those 
described for the group given only a recognition 
test. They were not included in the other com- 
parisons because they were not necessary to 
demonstrate the significance of the principal 
results, and it was felt that the recognition 
results might be regarded as more “pure” when 
not preceded by a recall test. 
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this finding is generally applicable 
beyond the particular techniques em- 
ployed in the present study. 


One hypothesis which might be advanced to 
explain the superior retention of the verbalization 
group would be that the Ss were unfamiliar with 
the names of the objects and that the verbaliza- 
tion procedure merely taught them the correct 
names. The resulting increase in proficiency at 
naming could be held to account for the superior 
performance of that group on the verbal tests 
(recall and verbal recognition). An attempt was 
made to reduce this possibility by choosing as 
stimuli common, everyday objects with which 
children might be expected to be familiar. That 
the Ss were familiar with the names of the 
objects is suggested by the results of the recall 
test, which were found to indicate that the dis- 
tributions of correct names used by the two 
groups were essentially the same. This would 
seem to imply that these names were equally well 
known by the two groups, and that the control 
group simply recalled them less frequently. 

A second hypothesis is that the superiority of 
the verbalization group is the result of the fact 
that the recognition test employed printed names 
similar to those used during the observation 
process, and that the verbal group was aided by 
recognition of the printed forms of the names as 
well as by memory of the objects themselves. 
However, similar reasoning would imply that 
the visual group would be better on the visual 
recognition test because the items used by them 
during the observation period were like those on 
the visual recognition test. While there was a 
significant interaction effect between the type of 
observation and the type of recognition test, the 
verbalization group still showed an over-all 
superiority on the combined tests. In fact, on 
the visual recognition test alone, there was either 
no difference (in terms of correct responses) or 
the verbal group was superior (in terms of incor- 
rect responses). 

It will thus be seen that, although the factors 
stated in these two hypotheses may have con- 
tributed to the results obtained, neither hypoth- 
esis, nor both in combination, can completely 
account for the present results. The effect which 
both fail to explain is the superiority in terms of 
decreased errors of the verbal group on the visual 
recognition test. Since this and the other find- 
ings concerning the effects of verbalization can 
all be explained in terms of our original hypoth- 
esis concerning the effect of verbalization upon 
retention, it seems justified to regard our results 
as lending confirmation to this hypothesis. 

At present, however, the mechanisms account- 
ing for the superior retention brought about by 
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verbalization are not clear. One possibility is 
that the procedure of verbalizing required the Ss 
in the experimental group to direct their atten- 
tion to more distinctive properties of the objects 
observed, while the control procedure may have 
allowed the Ss to direct their attention to less 
distinctive properties, sufficient to differentiate 
each of the original objects from the others in 
the observation array but not from the new 
objects introduced in the recognition test. Ex- 
amples of properties sufficient for the observa- 
tion task but not for the test performance are 
concrete properties, such as general shape, bright 
ness, and relative position in the array; or more 
abstract properties, such as belonging to the class 
of four-legged animals, vehicles, etc. 

An alternative to the above hypothesis is that 
the overt occurrence of a distinct response, such as 
pronouncing the names of the objects aloud, at 
the time of observation is necessary for the effects 
obtained. A differential test of these two hy- 
potheses would be provided by comparison of a 
group that merely circled printed names with 
one that both circled and pronounced them 
aloud. 

Superficially, the present results and those 
of a recent study by Belbin (2) may appear to 
lead to opposite conclusions. Belbin showed 
that interpolated verbal recall of stimulus ob- 
jects decreased the accuracy of subsequent 
recognition, while our results indicate that ver 
balization increases the accuracy. However, 
closer inspection of the problem suggests that 
the two sets of results are really complementary. 
It appears that the critical factor to consider is 
the accuracy of the verbalization which occurs. 
In the present study the accuracy of the labeling 
was guaranteed by providing the labels to the Ss, 
while in Belbin’s study the verbalization was 
made from memory without stimulus support 
and, hence, was subject to error. The relation- 
ship between errors of recall and subsequent 
failure to recognize the original picture is clearly 
shown in one of Belbin’s experiments by the fact 
that only in two cases in which no “importations”’ 
were made during recall was the original stimulus 
correctly identified. 

A second feature of Belbin’s study is probably 
responsible for the remarkably low frequency of 
recognition in his experimental (interpolated 
recall) group. This is the use of a single picture 
consisting of several related details as the stim- 
ulus and test object. An error in memory of a 
single detail might result in rejecting the test 
picture, even though many of the other details 
might be correctly remembered. If separate 
tests were made of the individual details remem- 
bered under Belbin’s conditions, the 
might turn out differently. 


results 
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SUMMARY 


Previously Bartlett (1) analyzed 
data on retention according to whether 
or not Ss had spontaneously verbal- 
ized during observation of stimulus 
materials, and found that verbalizers 
had superior retention. The present 
study was conducted to determine 
whether similar results would obtain 
when verbalization was introduced 
experimentally. 

Seventy-two elementary school chil- 
dren observed an array of 16 familiar 
objects. Half the Ss were given a 
sheet of names of the objects and the 
other half a sheet of pictures. As E 
indicated each object, Ss found and 
encircled on their sheet the name or 
picture of that object. In the group 
that had printed names, the name of 
each object was pronounced aloud as 
it was circled. No indication was 
given that there would be a later test 
for retention. 

After one week some Ss from each 
treatment were given a written recall 
test; others were given a recognition 
test made up of the 16 original and 
16 new objects. The latter test was 
given in two successive halves, one of 
which consisted of printed names of 
objects, and one of photographs. 

The principal findings were as 
follows: 


1. On the recognition test there 
was an over-all superiority of the ver- 
balization group, both in terms of 
more correct and of fewer incorrect 
responses. 

2. For both correct and incorrect 
response measures there was a signifi- 
cant interaction between the method 
of observation and the form (verbal 
or visual) of the recognition test. This 


was in the direction of a relatively 
greater superiority of the verbaliza- 
tion group on the verbal test. On the 
verbal test the verbalization group 
gave significantly more correct and 
less incorrect responses than the con- 
trol group. Even on the visual test 
the verbalization group made signifi- 
cantly fewer errors, although there 
was no difference between the two 
groups in number of correct responses 
on this test. 

3. Superior retention by the ver- 
balization group is also indicated by 
the results of the recall test. On this 
test the verbalization group made 
significantly more correct and fewer 
incorrect responses than the control 
group. 


The present results are compared 
with those of Belbin (2), who found 
a decrease in accuracy of retention as 
a function of interpolated recall, and 
it is suggested that the direction of the 
effects of verbalization depend upon 
the accuracy of the verbalization 
which occurs. 


(Received August 18, 1952) 
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THE DEVELOPMENT OF DIFFERENTIAL 
WORD RECOGNITION! 


DONALD G. FORGAYS 


Combat Crew Training Research Laboratory, Human Resources Research Center 


Experimentation designed to inves- 
tigate the accuracy of tachistoscopic 
recognition of words placed in the 
right and left peripheral fields of 
vision has been recently reported (3). 
Results indicate that Ss recognize sig- 
nificantly more words placed in certain 
parts of the right visual field than in 
corresponding parts of the left. Con- 
trol experiments ruled out explana- 
tions in terms of lateral cerebral 
dominance, selective attention, the 
anisotropy of visual space, and the 
disproportionate significance of parts 
of a word. The results were felt to 
be confirmatory of a selective retinal 
training and in conflict with the theory 
of a general equipotentiality in vision 
offered by Lashley (2). 

Hebb (1) has proposed that separ- 
ate parts of a receptor surface are indi- 
vidually trained. In reading English 
text, then, the next word to be recog- 
nized is always to the right of the 
word an individual is concerned with 
in central vision. The results of the 
experimentation referred to above 
reinforce the point that this situation 
appears to constitute a selective train- 
ing condition. If this is true, one 
would expect that a developmental 
study would provide further evidence 
on the effects of differential training. 
Specifically, one would expect that in 
the early educational grade levels 
there should be no gross differences in 
differential word recognition of words 
presented to the right and left of cen- 


1This research was carried out at Western 
Michigan College. Mr. James Boomgard and 
Miss Arlene Damerau aided in the collection of 
data. 


tral fixation. The slow assemblage of 
a specific receptor matrix should be 
reflected by the gradual dispersion of 
curves of recognition of words pre- 
sented to the right and to the left of 
central fixation plotted against edu- 
cational grade level. 


METHOD 


Twenty three- or four-letter English words 
were presented tachistoscopically to 144 stud- 
ents. Each word was } in. high and from } to 1 
in. long, depending upon the number of letters. 
Each word was printed in upper-case lettering 
on a white card which was used for presenting 
the word in both the right and the left visual 
fields. The characteristics of the rotating tachis- 
toscope used and the illumination have been 
described elsewhere (3). Briefly, here, the illum- 
ination was equal for the lateral halves of the 
field and the exposure time of the presentation 
was approximately .15 sec. 

The Ss used binocular vision and were seated 
about 24 in. in front of the stimuli; a fixed chin 
rest controlled this distance. 

Cross-hairs in front of the center of the expo- 
sure field were used as the fixation point. The 
stimulus words appeared in random order to the 
right or to the left of this point, the center of 
each word being 1} in. from the point. 

The words were common English words 
selected from an elementary reader; ten were 
three letters and ten were four letters in length. 
Each educational grade-level group consisted of 
12 Ss, six male and six female. The educational 
levels sampled were school grades 2 through 10 
and each of the first three college years. The Ss, 
words, and direction of words from the fixation 
point were arranged in a 2 X 2 latin square for 
each school grade-level group of 12 Ss. Fre- 
quency of recognition for any one group could 
then be determined for each of the two visual 
fields apart from particular stimulus words or 
particular Ss. 

Each S was acquainted with the apparatus 
and the experiment and given a short practice 
period before the test words were presented. 
During the exposure of the test words, two Os 
watched the Ss carefully for evidence of eye 
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movement. If such movement occurred at the 
time of exposure, the response was not recorded 
and another word was substituted for the one 


discarded. 
RESULTS 


Only a completely correct response 
was scored. Since no significant dif- 
ferences were found between the rec- 
ognition scores for three- and four- 
letter words or between the scores 
obtained by the male and female Ss 
of any one grade-level group, they 
were combined. 
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Recognition scores for total recog- 
nition as well as recognition to the 
right or to the left of fixation are pre- 
sented in Fig. 1 for each grade-level 
group. It will be noted that, as one 
would expect, the total recognition 
score increases gradually through the 
educational levels. The recognition 


scores for words to the right or left of 
fixation overlap considerably for Grade 
Levels 2 to 7 and diverge through the 
remaining groups. 
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Recognition scores for Ss representing 12 educational grade levels. 
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number of words recognized; WR = the number of words recognized when exposed to the right of 
fixation; WL = the number of words recognized when exposed to the left of fixation. 
each educational grade level. 
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To examine the significance of in- 
crease in total recognition with ad- 
vance in educational level as well as 
the difference in recognition of words 
presented to the right and to the left 
of central fixation, an analysis of 
variance was undertaken. The anal- 
ysis was based upon two measurements 
(right and left recognition) on several 
independent groups (the educational 
grade levels). The results of these 
calculations are presented in Table 1. 

To test whether the total recogni- 
tion scores for each of the 12 groups 
were drawn from a common normal 
population, the mean square between 
the educational grade levels was com- 
pared with that between Ss in the 
same group. For 11 and 132 df, an F 
of 16.7 is significant at the .001 level 
of confidence. On the basis of this 
F ratio and the results depicted in 
Fig. 1, we may conclude that total 
recognition scores are significantly 
higher for advanced educational groups 
than those for early educational 
groups. 

As a test of the significance of the 
difference between the extent of recog- 
nition of words presented to the right 
in contrast to those presented to the 
left of fixation, the mean square be- 
tween right and left recognition scores 
was compared with the mean square 
for the interaction of Ss and right and 
left recognition scores. For 1 and 132 
df, an F of 75.01 is significant at the 
.001 level of confidence. Words are 
recognized significantly more often 
when they are exposed to the right of 
fixation than when they are exposed 
to the left. 

The final test extracted from this 
analysis concerns the extent of differ- 
ential recognition as it relates to the 
educational grade level. The mean 
square for the interaction of right and 
left recognition scores and educational 
grade level was compared with the 


TABLE 1 


ANALYsIs OF VARIANCE oF RIGHT AND LEFT 
Worp Recocnition Scores or TWELVE 
EpucaTionaL GrapE-Levet Groups 








Source df | Squares F 





Between educational grade- 
level groups (total rec.) 11 | 536.37) 16.70 
Between Ss in same group | 132 | 384.79 


Total between Ss 143 | 921.16 


Between right and left 
recognition scores 1} 157.53) 75.01 

Interaction: right and left 
recognition X educa- 


tional grade level 11 | 169.84) 7.35 
Interaction: Ss X right 
and left recognition 132 | 277.13 
Total within Ss 144 | 604.50 
Total 287 | 1525.66) 














mean square for the interaction of Ss 
and right and left recognition scores. 
For 11 and 132 df, an F of 7.35 is 
significant at the .001-level of confi- 
dence. Reference to Fig. 1 and these 
results lead to the conclusion that the 
superiority of recognition of words 
presented to the right of fixation over 
the left is significantly related to edu- 
cational grade level. That is, there 
is greater recognition of words exposed 
to the right of fixation over the left for 
the higher educational groups. 


Discussion 


This experimentation has confirmed 
earlier research in showing that in late 
adolescence and adulthood the accur- 
acy of word perception is related to 
the field of vision in which words are 
exposed. In addition, it has shown 
that this differential accuracy is not 
present during the period roughly cor- 
responding to the educational grade 
years two through seven. 

These results support the hypoth- 
esis of a selective retinal training coin- 
cident with the educational process 
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through the conditions present in the 
reading of English text. It was sug- 
gested in an earlier paper (3) that a 
factor in this training process may be 
the neural equivalent of a selective 
visual attention during early years 
which is inoperative when learning is 
complete. Evidence concerning this 
suggestion may be obtained by a re- 
examination of Fig. 1. Itcan be seen, 
there, that the absolute number of 
words recognized when exposed to the 
left decreases at the educational Grade 
Levels 5 through 8 from what it is 
during the fourth and fifth grades. 
This may well be indicative of a selec- 
tive attention operating during these 
years but which is inoperative later as 
the earlier experiments (3) clearly 
demonstrate. 

In addition to supporting the hy- 


pothesis of a selective retinal training,’ 


these results are, as were those of the 
earlier research, inconsistent with the 
theory of a general equipotentiality in 
vision. 

SUMMARY 


1. The relationship between educa- 
tional grade level and recognition of 
words tachistoscopically exposed to 
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the left and right peripheral fields of 
vision was investigated. 

2. Total recognition of words was 
found to increase significantly with 
increase in educational experience. 

3. Recognition of words exposed to 
the right was found to be significantly 
greater than for words presented to 
the left of fixation. 

4. Superiority of recognition of 
words presented to the right of fixa- 
tion over the left is significantly 
related to educational grade level. 

5. These results support the posi- 
tion that separate parts of a receptor 
surface may come, during a process of 
training, to evoke different percent- 
ages of correct response. In support 
of earlier experimentation (3), these 
results are inconsistent with a theory 
of a general equipotentiality in vision. 


(Received August 19, 1952) 
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FOREKNOWLEDGE AS A FACTOR AFFECTING PERCEP- 
TUAL DEFENSE AND ALERTNESS! 


O. W. LACY, NATALIE LEWINGER, AND JOHN F. ADAMSON 


University of Pennsyloania 


During recent years a considerable 
literature has been produced concern- 
ing the part such personal factors as 
values, needs, rewards, and the like 
play in the process of perception. 
McGinnies (11,12) has reported data 
in which significantly higher recogni- 
tion thresholds are obtained for emo- 
tional over nonemotional words; these 
data are interpreted as being the result 
of the arousal of perceptual defenses 
against the recognition of the anxiety- 
producing, unpleasant stimulus words. 
Other investigators have reported that 
emotion-inducing stimuli may result 
in lowered recognition thresholds (1, 
8,10). 

To our knowledge no one has raised 
the question of whether there are dif- 
fering conditions under which the 
same emotion-arousing words will be 
differentially perceived. Will Ss, for 
example, see taboo words less easily 
than nontaboo words under one set of 
instructions and more easily with a 
different orientation? This question 
has interest since such an interaction, 
were it to be demonstrated, could not 
readily be explained on the grounds 
that taboo words occur less frequently 
in general reading, an hypothesis 
which has been offered by various 
critics (3,5,6,13,14). 

The primary purpose of this experi- 
ment was to test the hypothesis that 
whether or not S is slow or fast in 
reporting words depends on the nature 


! The authors wish to acknowledge their deep 
appreciation to Prof. Malcolm G. Preston whose 
stimulating instruction, kindly encouragement, 
and rigorous criticisms made the investigation 
possible. He is, of course, in nowise responsible 
for any errors we may have made. 


of his expectations at the time the 
word is presented to him and that this 
dependence is more marked in the case 
of taboo than nontaboo words. 


MetTuHop 


A pparatus.—The apparatus used for the pre- 
sentation of the stimuli to Ss was a Jenkins modi- 
fication of the Dodge tachistoscope, operated by 
an electrically driven timer. A _ calibration 
study* by two of the authors (NL and JFA) using 
a photokymographic technique established a 
linear relationship between length of exposure 
and amount of resistance coupled to the timer 
control; i.e., as resistance increased in steps of 
10 kilo-ohms, exposure time increased 2.96 
msec. Most of the computations, consequently, 
were based on the more easily handled resistance 
units. 


Design: Experiment I 

Effect of foreknowledge-——A 2 X 3 factorial 
design was employed using two “emotion” con- 
ditions (taboo and nontaboo) and three “fore- 
knowledge” conditions (None, General, and 
Specific). Since knowledge or expectation was 
to be an independent variable, the effect of which 
was to be determined solely by varying the 
experimental conditions, it was imperative that 
the thresholds be obtained from the first word a 
given S might see. To use mean recognition 
thresholds from a group of words given to one or 
more Ss would be to confound the foreknowledge 
condition: for by such a series S’s expectations 
would be surely altered in the direction of cer- 
tainty. In order to be able to generalize from 
our results, it was therefore required that several 
Ss under each condition be used, with each 
tested on a single stimulus. 

Thirty different Ss were randomly assigned, 
five each to the six experimental conditions. 
Each S was tested on a different word, so that the 
error term would include variability attributable 
both to S differences and to word differences. 
The 15 different taboo words, one for each of the 


2? We wish to acknowledge our indebtedness 
to Prof. William A. Shaw and to Mr. George 
Croswell in connection with this aspect of the 
study. 
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15 Ss assigned to the three foreknowledge varia- 
tions of the taboo condition, were randomly 
assigned under the restriction that the sexual, 
scatological, and ambiguous words should be 
apportioned as equally as possible among the 
three combinations of foreknowledge and taboo. 
The 15 nontaboo words were in the same way 
randomly assigned one each to the 15 Ss assigned 
to the three nontaboo and foreknowledge 
combinations. 

In order to meet objections which might be 
raised as to the essential quality of McGinnies’ 
emotionally charged words, the 15 taboo stimuli 
chosen were readily recognizable Anglo-Saxon 
obscene words. Of these were eight words with 
scatological and seven with sexual denotations. 
Every effort was made to choose words with wide 
geographic circulation. To our knowledge no 
word carried taboo meaning only in the argot of 
the perversions; nor were any taboo only in a 
colloquial or provincial sense. To complete our 
list, however, it was necessary to include three 
words of possibly ambiguous nature—words 
which are known to have quite socially accepta- 
ble, nontaboo meanings. In the list of taboo 
words, 13 were of four letters, 1 of three letters, 
and 1 of five letters. The latter two were 
among those of known ambiguity. 

The nontaboo list was drawn from Thorn- 
dike’s Teacher’s Word Book of 20,000 Words (15), 
under the following conditions: (a) a given word 
must come from the twentieth thousand in fre- 
quency—by this choice we hoped to reduce the 
possibility of obtaining differences which could 
be accounted for on the grounds that the non- 
taboo words were more frequently encountered 
in visual experience; (b) it must not begin with 
the letter w, x, y, or z; (c) it must be no longer 
than the matched taboo word; and (d) it must 
match, as nearly as possible, the structural char- 
acteristics of some one of the taboo words chosen. 
(The effort to obtain structural matches was 
undertaken with the purpose of maximizing the 
opportunity for confusion with the taboo words 
when the nontaboo word was misperceived. In 
this way we hoped to secure prerecognition 
guesses for an analysis comparable to McGinnies’ 
content analysis.) This was the nontaboo list: 


hunk ding teal swat  scow 
gens leal cusp dias iota 
shoo punt puck tale puffy 


The three degrees of foreknowledge may 
briefly be described as follows: (a2) No Knowledge 
upon the basis of which S could anticipate the 
word; (b) General Knowledge wherein S was 
told that he would see words which are com- 
monly thought to be obscene and taboo or that 
he would see words not often encountered in 
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general reading; and (c) Specific Knowledge, de- 
fined by £’s actual enunciation of the words S 
would see, though the words were never pre- 
sented orally in the same order that later 
obtained for the tachistoscopic presentation. 

Effect of habituation—Using the same six 
experimental conditions, the same taboo and 
nontaboo words were randomly assigned four 
more times to the 30 Ss. These assignments 
were made in an effort to study possible habitu- 
ation effects, and were subject to the restrictions 
governing the first assignment. No word, 
furthermore, was allowed to appear twice in the 
list for a given S, nor could a given word occur 
twice in the same order. It should be noted 
here that, by giving Ss four additional words of 
similar character under the same instruction, 
the effect of foreknowledge will be confounded. 
But this confounding will be of no grave concern 
if, in the analysis of Ss’ reactions to the first 
word seen, foreknowledge can be shown to have 
a significant effect. 


Design: Experiment II 


After completion of the original experiment, 
a replication was conducted using exactly the 
same words and conditions. Thirty new Ss 
were randomly assigned to the six experimental 
combinations, five to each cell, and the five 
stimulus word assignments were newly random- 
ized under the limits previously set forth. 


Subjects and Procedure 


The 60 Ss, all male graduate students in the 
social sciences and all naive with respect to the 
experiment, reported individually to the experi- 
mental room. The E (O.W.L. in each case) 
explained the nature of the apparatus and re- 
hearsed the presentation procedure by using two 
practice words (east and word), neither of which 
was of course used in the experimental trials. 
The S was instructed to guess if he liked, but to 
be sure to say the word just as soon as it was 
recognized. 

For the No Knowledge Ss, E then read, “You 
will see five words, each of which will be pre- 
sented until it is recognized and reported before 
going on to the next.” For the General Knowl- 
edge Ss in taboo cells, E read, “You will be pre- 
sented five words which are ones commonly 
thought to be obscene and taboo, each of 
which .. .” etc. For the nontaboo General 
Knowledge Ss, “You will be presented five words 
which are not often encountered in general read- 
ing, each of which . . .” etc. For the Specific 
Knowledge Ss, E read, “You will be presented 
five words like . ? \ 

, and , each of which...” 








etc. 
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In the tachistoscopic presentation series for 
each word, E began at the lowest setting avail- 
able—well below the visual threshold—and pro- 
ceeded upward in equal steps of 2.96 msec. until 
S gave a certain, and correct, response. All 
guesses were carefully recorded. When S gave 
an incorrect guess, E so indicated in a noncom- 
mittal way; when S gave a correct response which 
was obviously a guess, E asked noncommittally, 
“Are you certain?” and proceeded with the series 
until the response was certain. In all cases the 
certain reply was taken as the threshold. Fol- 
lowing the completion of the last trial, each S 
was sworn to secrecy until the experiment might 
be completed and was given full knowledge of the 
design and procedures and an explanation of the 
hypotheses under examination. 


RESULTS 
Effect of foreknowledge.—Figure 1 


presents the mean recognition thresh- 
olds for the first words seen under the 
two types of stimulus words and the 
three conditions of knowledge for the 
initial group of 30 Ss in Exp. I. With 
respect to the knowledge condition it 
shows that as S’s degree of knowledge 
about the stimulus increases, his rec- 
ognition threshold is in general low- 
ered. Whatever the effects may be 
relative to emotional or nonemotional 








stimuli, it is clear that in general an 
inverse relationship obtains between 
recognition thresholds and stimulus 
knowledge. 

Of more importance to the hypoth- 
esis under consideration is the char- 
acter of the interaction between the 
taboo and knowledge conditions. The 
sharp drop of the taboo mean recog- 
nition thresholds below the nontaboo 
thresholds under conditions of General 
and Specific Knowledge, as shown in 
Fig. 1, demonstrates clearly that the 
effect of foreknowledge serves to 
reduce recognition thresholds more 
rapidly and more markedly in the 
case of taboo stimuli. 

Since our design does not duplicate 
McGinnies’, to find support for his 
work we may cite as the best available 
comparison, that between the non- 
taboo and taboo cells under No Knowl- 
edge. It is evident by inspection of 
Fig. 1 that a difference exists; but in 
the initial data this difference is just 
short of significance at the .02 level of 
confidence. 
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Fic. 1. The recognition thresholds for 30 Ss in 
Exp. I to the first word seen 


Fic. 2. The recognition thresholds for 30 Ss in 
Exp. II to the first word seen 








Figure 2 illustrates the striking sim- 
ilarity of results obtained with the 
replication in Exp. II; the same drop 
of the taboo mean recognition thresh- 
olds below the nontaboo thresholds 
obtains under the General and Specific 
Knowledge conditions. 

Although inspection of the two 
figures seems to indicate discrepancies 
in certain of the cell means between 
the two sets of data, e.g., especially in 
the nontaboo No and General Knowl- 
edge cells, ¢ tests in the critical com- 
parisons indicate that these differences 
may in all cases be attributed to 
chance (p’s are all below the .10 level 
of confidence). 

Consideration of the insignificant t 
tests between the discrepant cell 
means and of the similarity of the 
separate analyses of variance for the 
two samples led us to pool the data. 
Since Bartlett’s test of homogeneity 
of variance (4, p. 196) had indicated 
significant heterogeneity in the orig- 
inal and replication data, the analysis 
of variance summarized in Table 1 
was performed on data transformed 
by the square root method in order to 
obtain homogeneity of the variance. 
This analysis confirmed the signifi- 
cance of the results with reference to 
the knowledge variable, the F ratio 


TABLE 1 


ANALYSIS OF VARIANCE FOR THE PooLeD 
TRANSFORMED Data BasepD on 
REAcTION TIME TO THE 
First Worp SEEN 











Source df Mean mend F 
Taboo 1 .025 ~— 
Experiment = .100 — 
Knowledge 2 2.705 4.73* 
TXK 2 3.370 5.89** 
TXE 1 .126 -— 
EXK 2 352 - 
TXKXE 2 .014 — 
Error 48 572 














* Significant at the .05 level. 
** Significant at the .01 level. 
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being significant between the .05 and 
.O1 levels of confidence. The critical 
interaction of Taboo vs. Knowledge, 
significant beyond the .01 level, is 
further evidence against the applica- 
bility of the frequency explanation 
under these conditions, and favors the 
belief that the character of S’s expec- 
tations determines whether he is alert 
or defensive. No other of the sources 
of variation approached significance. 

Incidental to these main findings is 
the support for McGinnies obtained 
from a comparison of the differences 
between the No Knowledge cells of the 
pooled data. The t of 3.53 with 18 df 
lies beyond the .O1 level. 

The protocols were next exam- 
ined with reference to the prerecogni- 
tion guesses. Although guessing was 
strongly encouraged, only 36 Ss made 
guesses to the first word they saw. 
The 85 different guesses, tabulated 
according to McGinnies’ criteria, are 
summarized on Table 2. The x? of 
10.02, significant beyond the .01 level, 
supports McGinnies’ finding that his 
“observers made proportionately more 
similar and part responses to neutral 
and proportionacely more unlike and 
nonsense hypotheses to the critical 
words” (11, p. 248). 

In this connection a tabulation of 
“guessers” and “nonguessers” for 
taboo and nontaboo conditions was 
made to see if mere guessing were asso- 
ciated with either condition. While 
an interesting trend toward propor- 
tionately more guessers in the non- 
taboo cells was noted, it cannot be 
considered reliable since the x? was 
not significant. Of the 29 guesses 
which could be classified as meaning- 
ful words in response to nontaboo 
stimuli, only one was a taboo word 
used in our list. Of the 34 guesses 
which were meaningful words in 
response to taboo stimuli, only 4 were 
taboo. 











PERCEPTUAL DEFENSE AND ALERTNESS 173 


TABLE 2 


PrerEcOGNITION GuEssEs, First Worp 




















Class Taboo Nontaboo Total 

Similar 
24 39 63 

Part 
Nonsense 

& 17 5 22 
Unlike 
Total 41 44 85 











x? = 10.02, significant at the .01 level. 


Habituation effects —It will be re- 
membered that each S responded to 
each of five words. The preceding 
analysis has to do only with the reac- 
tion times to the first word S saw. A 
similar analysis was conducted on the 
second word seen by each S, and on 
each of the succeeding words. This 
resulted in four additional analyses of 
variance; in each of these the same 
statistical hypotheses could be tested 
as were tested in the analysis of the 
times to the initial word. Without 
exception the F’s resulting from these 
analyses were insignificant. For this 
reason we do not reproduce the values 
here. Since no significant differences 
of any kind (first order or higher) were 
found in the analyses of words other 
than the first, we must conclude that 
whatever factors producing the de- 
layed times for taboo words exhibited 
without warning and the quickened 
times of taboo words with warning 
were substantially eliminated on the 
basis of a very small number of expo- 
sures. The way in which reaction 
times to taboo words seen with and 
without warning converge upon reac- 
tion times to nontaboo words with 
successive exposure is currently under 
investigation in this laboratory. 


Discussion 


The present experiment grew out of 
work which initially has been theo- 
retically explained by the notion of a 
perceptual defense. Competing with 
this conception is the explanation 
offered by Howes and Solomon (5) 
which supposed that the slower times 
for taboo words is due to the lesser 
frequency with which such words 
appear in visual experience. The 
present experiment enlarges the evi- 
dence with which these competing 
theories must deal. It is now neces- 
sary, not only to account for the 
longer reactions for taboo words seen 
under McGinnies’ conditions, but also 
for the shorter reaction times seen 
under our “warning” condition. In 
addition the competing theories must 
deal with the fact that differences in 
recognition thresholds involving taboo 
and nontaboo words disappear with 
habituation in experiments of this sort. 
The Howes-Solomon theory can ac- 
count for the slowness with which 
taboo words are seen without warning, 
but it cannot explain the fact that 
such words are seen more quickly than 
nontaboo words when expectation 
exists. Further the Howes-Solomon 
theory offers no grounds for explain- 
ing the habituation effects described 
above. 

Looked at in the context of Mc- 
Ginnies’ original theory the results 
suggest very strongly the bipolar 
character of the personal conditions 
which affect the speed with which per- 
ceptions are reported. In a sense, 
the experiment is a test of Bruner and 
Postman’s (1,2) views that these per- 
sonal conditions are of such a nature 
that at some times they operate to 
defend against stimuli and at others to 
promote alertness to the reception of 
stimuli, whether defensiveness or alert- 
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ness occurs depending upon the need 
prevailing in the receiver. 

The difficulties inherent in the 
notion of perceptual defense have been 
presented cogently by Luchins (9) and 
Howie (7). Here we wish merely to 
note that our results might more par- 
simoniously be fitted into the theory 
of set. Woodworth (16) recognized 
the importance of immediate past 
experience as a factor in both percep- 
tion and reaction time. We believe 
that the differing instructions in the 
present experiment are analogous to 
the situations he describes and served 
to change the relative dominance of 
certain sets in our Ss. 


SUMMARY 


Using a carefully calibrated tachis- 
toscopic method of presentation, 15 
obscene taboo words and 15 infre- 
quently encountered nontaboo words 
were presented to university graduate 
students. Recognition thresholds were 
determined in an effort to test the 
hypothesis that the speed with which 
words are reported depends on the 
nature of S’s expectations at the time 
of presentation and that this depend- 
ence is more marked in the case of 
taboo than nontaboo words. In an 
initial study of 30 Ss and in a repli- 
cation using 30 different Ss, conditions 
were developed such that the taboo 
words were less easily recognized than 
the nontaboo words when S had no 
reason to expect to see emotion-arous- 
ing words but that when such an expec- 
tation existed, the taboo words were 
the more easily recognized. 

These results, obtained under con- 
ditions where frequency can scarcely 
be said to be an important parameter, 
tend to corroborate the initial findings 
of McGinnies. They indicate further 
that the theoretical explanation of this 
and similar studies must account for 


the importance of very recently 
acquired changes in expectations. 


(Received August 11, 1952) 
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TRANSMISSION OF INFORMATION CONCERNING 
CONCEPTS THROUGH POSITIVE AND 
NEGATIVE INSTANCES! 


CARL I. HOVLAND AND WALTER WEISS? 
Yale University 


In the widely known conceptual 
learning studies of Smoke (6, 7) an 
attempt was made to determine the 
contributions of “positive instances” 
(where the required characteristics of 
the concept are included in the stim- 
ulus figure) and “negative instances” 
(where one or more of the required 
characteristics is absent) in the acqui- 
sition of concepts. The conclusion 
was reached that negative instances 
do little to facilitate the learning of 
concepts. No data, however, are pre- 
sented as to what information was 
transmitted by the negative instances. 
Thus one cannot tell whether the low 
efficiency of the negative instances is 
attributable to the fact that they com- 
municated little information beyond 
that already conveyed by the positive 
instances, or is caused by greater diffi- 
culty in assimilating information 
transmitted in the form of negative 
instances. 

It is difficult to separate the effects 
of the two alternatives just mentioned 
when one employs the unstructured 
type of situation used by Smoke, since 
the information conveyed by each 
type of instance cannot be assessed 
accurately in the absence of knowledge 
of the hypotheses considered by S. 
Accordingly, for experimentation on 


1 This research was carried out as part of the 
Yale Communication Research Program, which 
is under the senior author’s general direction. 
Grateful‘acknowledgment is made to the Rocke- 
feller Foundation for support of this work. 

The writers wish to thank Professors J. C. R. 
Licklider, B. J. Underwood, and F. D. Sheffield 
for helpful criticisms and suggestions. 

2 Now at Boston University. 


this problem it is preferable to use a 
situation where S knows the structure 
of the concept and the dimensions 
involved. One can then determine 
how many of each type of instance are 
required from a logical standpoint to 
specify completely the characteristics 
of the concept. 

A mathematical analysis of the 
number of positive and negative in- 
stances required to communicate 
concepts under various specified con- 
ditions has been made by Hovland 
(2). The theoretical model he ana- 
lyzes is one in which each concept 
involves a combination of certain spe- 
cific values along two or more stimulus 
dimensions. Form, color, quantity, 
and size are examples of “dimensions” 
and within each dimension there may 
be a number of “values,” e.g., red, 
green, and yellow within the color 
dimension. As an example, one can 
transmit a concept through successive 
presentations of Weigl-type cards. 
The S will be told that the possible 
stimulus dimensions will be form, 
color, and size and that there will be 
three possible values of each. If he 
is also told that the concept will con- 
sist of a combination of one value of 
each of two dimensions [giving con- 
cepts such as that of a “black square”’ ] 
it will take a minimum of two positive 
instances or of ten negative instances 
to define the correct combination. If 
positive instances are used and the 
first card contains a medium-sized red 
triangle and the second contains a 
medium-sized green triangle, S could 
deduce that the concept is a medium- 
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sized triangle (of any color). Alter- 
natively the same concept can be com- 
pletely defined by stating that the 
following are incorrect (“negative 
instances’’): small red square, medium 
green circle, small green triangle, large 
red circle, medium yellow square, large 
red triangle, large green square, me- 
dium red circle, small yellow circle, 
large yellow triangle. These ten 
instances rule out all other combina- 
tions of single values of two dimen- 
sions. Knowing that the correct con- 
cept can be transmitted by either two 
positive or ten negative instances per- 
mits one to raise the experimental 
question as to whether S will derive 
the correct answer with equal ease 
from the two sets of instances. To 
obtain information on this point is one 
of the principal purposes of the present 
set of experiments. 

Additional problems are raised con- 
cerning the transmission of concepts 
by combinations of positive and nega- 
tive instances. From the standpoint 
of the information transmitted a posi- 
tive instance followed by two negative 
instances conveys the same amount of 
information concerning the character- 
istics of the concept as two negative 
instances followed by a positive. But 
from the standpoint of the assimila- 
tion of the information by S it is quite 
possible that the two sets have differ- 
ent effectiveness. One reason for 
hypothesizing a difference is that in a 
mixed positive and negative series the 
effect of positive instances is to greatly 
reduce the number of hypotheses which 
must be considered, while negative 
instances specify which of the alterna- 
tivescan be discarded. If thepositives 
come first, S would only have to keep 
in mind a limited number of possi- 
bilities; whereas, when the negatives 
come first, only a few possible hypoth- 
eses are eliminated and therefore S 
must retain quite a few alternatives 


until the positive instance finally 
defines the correct choice. Thus it 
might be expected that the former 
arrangement would be superior to the 
latter in ease of concept attainment. 

For both types of problem, knowl- 
edge of the information transmitted 
by positive and negative instances 
enables one to hold the amount of 
information communicated constant 
in order to study experimentally the 
“‘psychological” factors affecting the 
manner in which information is assim- 
ilated. Differences in difficulty 
of learning concepts from negative 
instances as compared with positive 
instances can then be attributed to 
differences in the nature of the psycho- 
logical processes rather than to differ- 
ences in the amount of information 
about the concept conveyed by the 
two types of instances. 


EXPERIMENT I 


Purpose.—This experiment was de- 
signed primarily to compare the learn- 
ing of concepts when the necessary 
information is conveyed by series of 
instances which are exclusively posi- 
tive or exclusively negative. The 
effect of the order of presentation of 
mixed positive and negative instances 
was also investigated. 


Procedure.—The S’s task was to derive the 
combination of characteristics defining a concept 
on the basis of stimulus figures which either in- 
cluded all the requisite characteristics (positive 
instances) or lacked one or more of the essen- 
tial characteristics (negative instances). Stim- 
ulus figures were constructed using the following 
dimensions: form, color, quantity, and size. 
The values of the form dimension were the 
figures X, H, O, and 8. Values of the color 
dimension were blue, red, and black. For the 
size dimension the following values were em- 
ployed: small (} in. high), medium (1 in.), and 
large (1$ in.). Finally, there were four values 
of the quantity dimension, 1, 2, 3, or 4 identical 
figures being used. In some of the series three 
dimensions were used, in others, four. From 
two to four values of each factor were employed 
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in any particular series. The numbers of dimen- 
sions and values for each series are specified in 
the tables below. 

The S was first instructed as to the meaning 
of the term “concept” as it would be used in the 
experiment. He was told before the presen- 
tation of each problem how many different 
dimensions he would have to consider, the num- 
ber of these which would be relevant, the num- 
ber of possible values for each dimension, and 
the number of values which would be correct 
for each relevant dimension (cf. 2). S was given 
some practice examples to solve, illustrative of 
each of the several ways of presenting the infor- 
mation needed to define the concept. The E 
would indicate any errors that had been made in 
the practice problems. 

The procedure for an individual S on a par- 
ticular series may be illustrated by an example: 
S would be told that three dimensions, color, 
form, and quantity, would be used; that two 
values of each would be involved: red or black 
for color, square or circle for form, and one or 
two for quantity; that the concept would consist 
of one value of each of two dimensions; that five 
cards showing examples of what the concept 
“was not” would be given. Card 1, containing 
one red square, was then exposed. Card 2 con- 
tained two black circles; Card 3, one black circle; 
Card 4, two red circles; Card 5, two red squares. 
[Correct answer—black square. ] 

Because of the likelihood that some dimen- 
sions would be more difficult than others, the 
dimensions and values for the various problems 
were rotated in such a way that for the experi- 
ment as a whole the all-positive and all-negative 
series involved the same dimensions and values. 
The order of positive and negative series was 
also counterbalanced. 

Twelve Yale College students served as Ss. 
Each came for two I-hr. sessions. The figures, 
printed on 3 X 5-in. cards, were exposed one at a 
time for 5 sec. on a table in front of S. Positive 





instance cards were labelled “Right” and nega- 
tive “Wrong.” For each series the minimum 
number of cards needed to specify completely 
the given concept was employed. After the set 
of cards for a particular concept was presented, 
S was given 20 sec. to give his answer. 
Additional cases were secured under group 
experiment conditions from a section of the 
elementary psychology course at Yale (V=95). 
Here the cards were flashed one at a time on a 
screen by means of an opaque projector. Each 
card was exposed for 5 sec., and 20 sec. was 
allowed at the end to write down the answer. 


Results.—The percentages of Ss able 
to define the concept correctly after 
being exposed to a series of positive or 
a series of negative instances are pre- 
sented in Table 1. The number of 
instances of either type was the mini- 
mum number needed logically to de- 
fine the concept. So in terms of the 
information presented, equally precise 
answers could be given in either case 
if Ss were able to absorb the presented 
information equally well. The results 
indicate that under the conditions 
studied it is considerably harder to 
define a concept on the basis of a series 
of instances showing what the concept 
“is not” than on the basis of a series 
of instances showing what the concept 
“is.” Chi-square tests show that the 
difference in percentage of Ss attaining 
the concept from the two types of 
instance is significant at better than 
the .001 level for all series. 


TABLE 1 


PERCENTAGE oF Ss Atrarntnc Correct Concert witn Aut-Positive vs. ALL-NEGATIVE 
Instances (SuccessivE PRESENTATION) 

















Stimulus Series Per Cent of Ss Attaining Concept 
Total No. lo N 
No. hy Dimen- vilnus tienes | tetenets All-Positive All-Negative Dy? 
Dimen- | values sions Correct | Required | Required Instances Instances 
sions * Relevant 
3 3 2 1 2 10 100.0 (N= 24) | 16.7 (VN=24) 1 
3 2 2 1 2 5 100.0 (V=24) | 16.7 (V=24) 1 
4 2 2 1 2 5 (a) 95.8(N=24)| 4.2 (N=24) 001 
(b) 94.7 (N=95) | 13.7 (N=95) l 
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Table 2 compares the ease of learn- 
ing concepts when the sequence is first 
positive and then negative as com- 
pared with that when the sequence is 
first negative and then positive. Series 
were used where one positive and two 
negative instances were required, so 
the sequences compared were positive- 
negative-negative vs. negative-nega- 
tive-positive. Both series are equal 
in the amount of information about 
the concept which they contain and 
completely specify the concept. The 
data provide no suggestion of any dif- 
ference in difficulty under the present 
circumstances, so the initial hypoth- 
esis that the series with positive in- 
stances first would be superior is not 
confirmed. 


EXPERIMENT II 


Purpose.—In the preceding experi- 
ment only two positive instances were 
required to specify the concepts, 
whereas either five or ten negative 
instances were required to define them 
with equal precision. One possible 
contributory factor in the obtained 
greater difficulty of learning concepts 
from negative instances than from 
positive instances might therefore be 
difficulty in remembering the longer 
list of items required for the former. 
Accordingly, the first experiment was 


repeated on a second sample of Ss 
with the modification that all of the in- 
stances in any series were exposed si- 
multaneously. Data were also secured 
on the effectiveness of combinations 
of positive and negative instances. 


Procedure.—The method paralleled closely 
that used in the first experiment except that the 
entire series of cards was exposed at once. 
Twelve Yale College students were employed as 
Ss. They were seen individually on two sepa- 
rate occasions of about l-hr. duration each, 
separated by at least a day. 


Results.—A comparison of the num- 
ber of Ss attaining concepts correctly 
when they are transmitted by an all- 
positive or an all-negative series is 
shown in Table 3. As in the case of 
successive presentation there is a con- 
sistent superiority for the all-positive 
series, although this set contains the 
same amount of information about the 
concept as the all-negativeone. When 
the results for all three series are com- 
bined, the difference is significant at 
better than the .001 level (x? = 16.3, 
df = 3). However, it will be seen 
that the percentage of Ss able to 
attain the concept from the all-nega- 
tive series is substantially higher than 
the number under the equivalent suc- 
cessive conditions (Table 1). This 
supports the prediction of Underwood 
(8), which he based on contiguity con- 
siderations. With positive instances 


TABLE 2 


PerceNTAGE oF Ss Atratninc Correct Concept wITH VARYING SEQUENCES OF 
Positive AND NEGATIVE INSTANCES 
(SuccesstvE PRESENTATION) 








Stimulus Series 


Per Cent of Ss Attaining Concept 




















Total Total _ No. No. Positive, Negative Negative, 
Dimensions Values yy Convent segue Paitive oe 
4 2 2 1 37.5 (N=24) 45.8 (V=24) SD 
3 4* 2 1 (a) 50.0 (N=24) |(a) 45.8 (N=24) .80 
(6) 21.1(N=95) |(6) 22.1 (N=95) 85 








* Except for color, where only three values were used. 
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TABLE 3 


PercentaGce or Ss Atraintnc Correct Concept wit Aui-Positive anp ALL-NEGATIVE 
Instances (SimuULTANEOUS PRESENTATION) 
N= 12 








Stimulus Series 


Per Cent of Ss Attaining Concept 








Total Total No. No. 
No. No. Dimensions} Values 
Dimensions} Values 


No. Pos. 
Instances | Instances 


No. Neg. | All-Positive | All-Negative 
Instances Instances Px? 








Relevant Correct Required | Required 
3 3 2 1 2 10 100.0 50.0 <.02 
3 2 2 1 2 5 100.0 83.3 0.12 
} 2 2 1 2 5 100.0 66.7 0.10 


























the number of Ss answering correctly 
is so high even with successive presen- 
tations that no marked difference is 
obtained between the two temporal 
arrangements. 

Table 4 presents a comparison of 
the percentage of Ss attaining the 
correct concept when two negative 
instances are substituted for one of 
the positive instances (giving one 
positive and two negative instances), 
as compared with a series in which 
only the two positive instances are 
employed. In none of the compari- 
sons are as many Ss able to reach the 
concept when mixed positives and 
negatives are used as when only posi- 
tive instances are employed. In the 


easier series with one value correct, 
the differences are not significant, but 
significant differences do obtain for 
the more difficult series with two 
values for each dimension correct. 
When the results for the two more 
difficult series are combined, the dif- 
ference between the all-positive and 
mixed positive and negative results is 
significant at better than the .01 level 
(x? = 11.2, df = 2). 


EXPERIMENT III 


Purpose.—The results of Exp. II 
indicate that although the difference 
between positive and negative in- 
stances is reduced when memory 


TABLE 4 


PercenTaAGE or Ss Atratntnc Correct Concept witn Aui-Positive as CoMPARED WITH 
Mrxep Positive anv Necative Instances (S1ImuULTANEOUS PRESENTATION) 

















N= 12 
Stimulus Series Per Cent of Ss Attaining Concept 
Total Total No. No. Two One Positive; 
No. No. Dimensions Values iti tee py? 
Dineustens Values Relevant Correct Positives Two Negatives 7 
3 = 2 1 100.0 83.3 12 
4 Tt 2 1 100.0 91.7 30 
Three Positives| Two Positive; 
Two Negatives 
3 . 2 2 91.7 50.0 07 
4 T 2 2 75.0 33.0 10 


























* Four for quantity and form, three for color. 
+ Four for quantity and form, three for color and size. 
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factors are minimized by simultaneous 
presentation, a significant superiority 
for the positive instances remains. 
But comparison of the number of 
instances suggests another possible 
factor: In both Exp. I and II more 
negative than positive instances were 
necessary to completely convey the 
concept. The greater difficulty of 
negative instances might be due to the 
fact that it is more difficult to assimi- 
late the larger number of instances. 
By appropriate selection of cases it is 
possible to transmit concepts by an 
equal number of positive, negative, 
and mixed instances. In this experi- 
ment, therefore, series were employed 
in which the same amount of infor- 
mation was conveyed and the same 
number of instances were used for all 
positive, all negative, and mixed posi- 
tive and negative series. 


Procedure—The general method employed 
was very similar to that used in Exp. II. Again, 
simultaneous exposure of all the instances needed 
for a particular problem was used. The prob- 
lems, however, were selected in such a way that 
four instances would completely specify the con- 
cept for each type of problem (2). Three types 
of series were used, involving four positive, four 
negative, or two positive and two negative 
instances. 

Instead of a single set for each concept as in 
the previous experiments three different sets 


were prepared, so that three “trials” were per- 
mitted. The first set of four was exposed and S 
attempted an answer. If he did not get the 
concept correctly, a second set of four different 
instances of the same concept was presented. If 
he failed on the second set, a third set of instances 
was shown. The time limit on each presenta- 
tion was 3 min. 

For this experiment a newly developed series 
of flower designs (3) was employed instead of the 
Weigl-type cards used in the two other experi- 
ments. The flower designs allowed the use of 
four values of each of four dimensions: four types 
of blossom were employed; each could be colored 
red, yellow, blue, or white; one of four differently 
shaped leaves could be used; and from one to 
four identical leaves were placed on the stem. 
In one series of problems all four dimensions 
were included, in the other, three. Counter- 
balancing to rotate the dimensions and values for 
the three conditions was employed as in the 
earlier experiments. In this way any differences 
between the groups could not be attributed to 
differences in the difficulty of the dimensions 
involved. 

Twenty-four Yale College Ss were employed; 
none of them had taken part in any of the earlier 
experiments. They were seen individually in a 
session lasting about 1 hr. 


Results —Table 5 presents the cum- 
ulative percentages of Ss able to 
specify the concept correctly after one, 
two, and three presentations of the 
all-positive, all-negative, and mixed 
positive and negative series of in- 
stances. For both the three- and four- 
factor series there is a clear difference 


TABLE 5 


Concert ATTAINMENT ON First, SEconpD AND Tutrp PresENTATION oF ALL-PosiTIvE, 
Autit-NEGaTIvE, AND Mixep Positive anp NEGATIVE INsTANCE SERIES 

















N=24 

Stimulus Series Cumulative Per Cent of Ss Attaining Concept 

LS gy LS my - No. oa 2 -_ All ow All 

‘ » i ues resentati cot , 
Dimenstons Values Relevant . Correct manen| Positive Negative Negative 

1 79.2 54.2 20.8 

3 4 2 3 2 95.8 70.8 41.7 

3 100.0 79.2 54.2 

1 91.7 37.5 12.5 

4 4 2 3 95.8 45.8 33.3 

3 95.8 58.3 50.0 
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between the all-positive and all-nega- 
tive series in the percentage of Ss 
attaining the concept after one, two, 
and three presentations of the series 
(each py2 = .001 or less). When the 
results for the three- and four-factor 
series are combined, there is a signifi- 
cant difference on the first trial 
between the combined positive and 
negative and the all-positive series 
(px2 = 001) and between the com- 
bined and all-negative series (p = .01). 
On the third trial the difference be- 
tween the positive and the mixed 
series is still significant (py2 = <.001) 
but that between the all-negative and 
mixed series is no longer sizeable 
(Px2 7 .13). 

In a recent article Underwood (8) 
has suggested that one would expect 
that concepts will be learned more 
slowly when a greater number of 
stimuli are involved. A test of this 
can be made by comparing series 
where four dimensions are involved 
with the series where there are only 
three. The same number of values 
was employed and the same number 
of instances was required. In seven 
of the nine possible comparisons learn- 
ing is more rapid for the three-dimen- 
sion series than for the four-dimension 
series, but the tests are not completely 
independent. The likelihood that the 
number of dimensions is important, 
however, is increased by the fact that 
similar results were obtained in the 
other two experiments, particularly 
for the negative series (Tables 1 and 
3). Asimilar conclusion can be drawn 
from the experiment of Reed (5). 


Discussion 


While a machine could be con- 
structed which would arrive at the 
correct concept with equal ease on the 
basis of the positive or negative 
instances, the results of the present 


experiment clearly indicate that the 
human organism does not operate 
similarly on a strict probability basis. 
For all of the conditions examined, 
more Ss arrive at the correct concept 
when the instances are positive than 
when they are negative, even though 
each series transmits the same amount 
of information. At the same time, 
the data disprove the generalization 
often cited that negative instances 
have no value in the learning of con- 
cepts. Under appropriate conditions 
over half of the Ss were able to reach 
the correct solution solely on the basis 
of negative instances. 

A major research task remains, how- 
ever, that of determining the factors 
responsible for the greater effective- 
ness of positive instances. Perceptual 
factors undoubtedly play some role in 
the explanation. Positive instances 
have the required characteristics di- 
rectly perceptible. Negative in- 
stances, on the other hand, do not 
have the correct combination of char- 
acteristics in direct view. Closely 
related is a difference in the charac- 
teristics of the correct concept and the 
one delineated by the negative in- 
stances. The positive instances define 
a coherent, well-structured combina- 
tion of characteristics of the concept, 
while the negative instances are in 
effect positive instances of what the 
concept “is not.” They thus lack an 
organized, unitary quality, except for 
the limiting situation where only a 
single dimension is involved. 

A number of limitations of the pres- 
ent research must be mentioned. 
First, it is based on only one of a num- 
ber of possible concept models. Sec- 
ond, it is concerned only with concept 
formation, and does not consider con- 
cept generalization, i.e., applying the 
learned concept to new examples. 
Finally, it covers only how a single 
concept is learned, and does not treat 
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the frequent situation where a positive 
instance for one concept serves simul- 
taneously as a negative instance for 
other concepts in the list (e.g. 4). 
Here additional factors, such as stim- 
ulus generalization, are needed to 
account for interactions between items 
(1). 


SUMMARY 


Smoke (6, 7) reported that negative 
instances are rather ineffective in the 
learning of concepts. The question 
arises as to whether this is because 
negative instances convey little infor- 
mation or because it is difficult for Ss 
to assimilate material presented in 
this form. In the present experi- 
ments the amount of information 
about a concept was equated for the 
two types of instance, so that any 
differences in learning the concepts 
could be attributed to differences in 
difficulty of assimilating information 
concerning what the concept “‘is” as 
compared with assimilating informa- 
tion concerning what it “is not.” 
Data derived from the analysis of 
Hovland (2) were used to determine 
the minimum number of positive and 
negative instances needed to specify 
completely any particular concept. 

In the first experiment Ss were 
shown successively a series of Weigl- 
type cards conveying a concept by 
all-positive instances or by all-nega- 
tive instances. In the second experi- 
ment equivalent series of all-positive 
and all-negative instances were pre- 
sented simultaneously, to minimize 
possible memory effects. In the third 
experiment series of flower design 
cards (3) were selected in such a way 
that transmission of the concept 
required an identical number of posi- 
tive, negative, or mixed positive and 
negative instances. 

Under the experimental conditions 
employed the results indicate: 
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a. The correct concept is attained 
by a higher percentage of Ss when 
transmitted by all-positive instances 
than by all-negative instances. 

b. Mixed positive and negative in- 
stances are intermediate between all- 
positive and all-negative series in 
difficulty of learning. 

c. When the negative instances are 
displayed simultaneously, the accu- 
racy of concept attainment is higher 
than when they are _ presented 
successively. 


The all-negative instances are thus 
shown to be consistently inferior to 
all-positive. At the same time, the 
results disprove the generalization 
that concepts cannot be learned from 
negative instances, since under appro- 
priate conditions over half of the Ss 
were able to arrive at the correct con- 
cept exclusively on the basis of nega- 
tive instances. 


(Received August 21, 1952) 
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INCENTIVE, ANXIETY, AND THE HUMAN BLINK RATE! 


DONALD R. MEYER, HARRY P. BAHRICK, AND PAUL M. FITTS 
The Ohio State University 


This experiment tests the hypothesis 
that the introduction of an incentive 
for performance of a visually guided 
task will bring about an increased fre- 
quency of blinking during the interval 
between trials. The change is pre- 
dicted by a theory of response inter- 
action. developed by Meyer (8). Ac- 
cording to the theory, two or more 
simultaneous responses are interre- 
lated as a function of the degree of 
overlap between the activated motor 
pathways. From this it follows that 
a given response is most readily 
affected by responses which utilize 
neighboring central nervous channels. 
Since the motor mechanism for the 
eyelid is strategically placed between 
the pathways for the face and hand 
(9), almost every response that a man 
can make should alter his tendency to 
blink. No attempt is made to account 
for the fact that he blinks in the first 
place. 

The amount of alteration should be 
a function of the spatial distribution 
of activity in the adjacent pathways, 
on the assumption that degree of over- 
lap is slight if activity is concen- 
trated. This distribution can be de- 
termined indirectly by measurements 
of muscular tension patterns. It has 
been established that tension is typi- 
cally generalized under conditions of 
performance which augment task diffi- 
culty. Lack of skill, distractions, 


1 This research was supported in part by the 
United States Air Force under Contract No. AF 
33(038)-10528 monitored by the Perceptual and 
Motor Skills Research Laboratory, Human Re- 
sources Research Center. Permission is granted 
for reproduction, translation, publication, use 
and disposal in whole and in part by or for the 
United States Government. 


complex materials, and long work 
periods all are contributing factors. 
Increased incentive has a_ similar 
effect, and this is understandable in 
view of the fact that a high level of 
performance is more difficult to main- 
tain than a low. In summary, then, 
it should be possible to alter with 
incentive the pattern of activity in a 
response, and to detect this change by 
appropriate examination of the blink 
rate. 

Indirect support for this analysis is 
provided by the results of Clites (4) 
and of Telford and Storlie (12). Some 
opposition can be found in studies of 
the blink rate as an index of the cost 
of visual work. The arguments pro 
and con on this point-fave been sum- 
marized by Luckiesh (7) and Bitter- 
man (2). Meyer has pointed out that 
the conflicting data which led to this 
controversy are understandable if it is 
recognized that blinking is attenuated 
when performance is under close visual 
control. Thus two antagonistic pro- 
cesses operate in the visual task situa- 
tion, and for this reason measurements 
in the present experiment are taken 
after performance is complete. That 
effects of simultaneous activities can 
be demonstrated under these condi- 
tions is attributed to the fact that 
relaxation takes time. 

A test is also provided for the hy- 
pothesis that individuals who score 
high on inventories of “anxiety” have 
high blink rates as well. Taylor (11) 
has reported that Ss show a greater 
frequency of blink CR’s if they belong 
to a group chosen for high “manifest 
anxiety,” while Hilgard, Jones, and 
Kaplan (6) have shown that such indi- 
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viduals form discriminative CR’s less 
readily than Ss who score low on the 
screening questionnaire. Meyer (8) 
has attributed these results to facili- 
tation of the eyeblink by the simul- 
taneous responses which make up the 
characteristically high level of mus- 
cular activity found in “anxious” Ss. 
These same responses should have an 
effect upon the blink rate, for the 
theory of interaction gives a similar 
account for modifications of latency 
and magnitude of single and repetitive 
responses. 


METHOD 


Subjects—The Ss for this experiment were 
52 men and 13 women from elementary courses 
in psychology and engineering. All had normal 
or corrected-to-normal vision. Ages ranged 
from 17 to 39, with a median of 23. 

Apparatus—An adjustable chair for S was 
placed in front of a table which supported the 
task apparatus and a three-sided, roofed cubicle 
constructed from acoustic wallboard. Footrests 
were provided, and a panel mounted on the ceil- 
ing of the cubicle served as a headrest. The 
interior of the cubicle was illuminated by a 60-w. 
bulb attached to the opposite side of the head- 
rest panel. A one-way vision screen covered an 
aperture cut into the near wall of the cubicle so 
that a concealed E could observe the eyelids 
of S. 

The central problem for S was a pursuit task 
provided by the Buxton Division-of-Attention 
Apparatus (3). There were a number of addi- 
tional devices which provided peripheral tasks 
as well. One of these was a semicircular array 
of four lights which could be turned out by 
switches manipulated with the left hand. A 
second peripheral task was devised from the 
Grether Dual-Pursuit Apparatus (5). It con- 
sisted of a dial whose pointer would move away 
from a 12 o’clock, correct position. Such devia- 
tions could be corrected by S by rotation of a 
control knob with the left hand. Tonal stimuli 
were also presented from time to time, and these 
were generated by a Jackson Model 655 audio 
oscillator. The arrangement of tasks was de- 
signed to test implications of a theory of percep- 
tual restriction. The details of this aspect of 
the study are reported separately (1). 

Procedure.—Each S was first given two paper- 
and-pencil personality tests. One was made up 
of items related to tension which were taken 
from the Taylor inventory (11), and a number of 
others related to motivation. The other was 


the Rotter Sentence Completion Test (10), which 
was subsequently scored for maladjustment by 
an expert clinician. 

At the beginning of the experiment proper, S 
was seated at the apparatus and told to maintain 
the same posture throughout the series of trials. 
Formal instructions for the operation of the 
Buxton apparatus followed. Further instruc- 
tions were given a trial at a time. There were 
14 trials in all, each 1 min. in length and followed 
by a 30-sec. rest interval. The special conditions 
for the trial were as follows: 

Trials 1-2. All Ss practiced the Buxton task 
alone. The E emphasized that the scores did 
not count, that S should just relax and familiar- 
ize himself with the task and procedure. 

Trial 3. The Ss were divided into two com- 
parable groups. Those in a high-low incentive 
group were now told that performance counted, 
and that bonuses would be given which ranged 
from 5 cents to $1.50 depending on performance. 
The schedule was arranged so that the bonuses 
earned were usually 5 or 10 cents on this trial. 
The Ss of a low-high incentive group, who were 
to receive their chance at a bonus in the later 
trials, were given no additional instructions other 
than that this was another practice trial. Inci- 
dental tones and lights were presented by £ to 
Ss of both groups, who were subsequently ques- 
tioned about them as a part of the perceptual 
restriction experiment. 

Trial 4. All Ss were told that this was a 
practice trial, that scores did not count. Now, 
however, they were instructed to watch for lights 
which would appear while they were performing 
the pursuit task. At such times they were to 
turn off the lights with the switches provided. 
It was stressed that the two tasks were of equal 
importance. 

Trials 5-6. The performance required of all 
Ss was the same as for the preceding trial, but 
now the individuals of the high-low incentive 
group were offered a money bonus for superior 
performance. The schedule ranged from 5 cents 
to $3.00, and most Ss earned from 10 to 15 cents. 
The low-high group, on the other hand, was told 
that practice trials were to continue. 

Trial 7. All Ss were now informed that the 
task was somewhat different, that in addition to 
the pursuit task they were to watch for jumps of 
the pointer on the Grether dial. Deviations 
were to be corrected with the knob. It was 
stressed that the scores did not count, and that 
the tasks were of equal importance. 

Trials 8-9. Now the high-low incentive group 
was given another chance at a bonus for the per- 
formance practiced on Trial 7, according to the 
5 cents to $3.00 schedule. Most Ss earned about 
25 cents. The low-high incentive group was told 
that practice trials would continue. 
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Trials 10-11. At this point the incentive con- 
ditions for the two groups were reversed. The 
high-low Ss were told that no more bonus could 
be earned, that trials which followed were prac- 
tice trials. The task was the same as that for 
Trials 5-6, and Ss of the low-high group were 
offered the bonus. 

Trials 12-13. The conditions were the same 
as those for Trials 8-9, but the individuals of the 
low-high group were given the bonus formerly 
offered to the high-low Ss. 

Trial 14. The low-high group received a 
bonus for pursuit task performance, and faint 
lights and tones were presented. 


REsULTS 


The average number of blinks re- 
corded for each group in the 30-sec. 
rest period which followed each trial 
is shown in Fig. 1. The scores for 
both groups are essentially the same 
at the beginning, but there is a sharp 
increase in the mean blink rate of the 
Ss in the high-low group when the first 
bonus is offered on Trial 3. On Trial 


4, which is a practice session with no 
bonus, the difference between the 
groups disappears. It is evident again 
on Trials 5 and 6, which are bonus 
sessions for the high-low group. The 
increased blink rate is maintained on 
Trial 7, even though no bonus is 
offered, which suggests the persistence 
of an effect built up by two bonus 
trials in succession. There is also a 
relatively slow decay of the blink rate 
for the high-low group after the final 
bonuses are given on Trials 8 and 9. 
Statistical tests of the difference 
between the groups on Trials 3, 5, 6, 
8, and 9 combined yield a ¢ of 3.24, 
which is significant at the .01 level. 
The introduction of a bonus for the 
low-high group on the last four trials 
gives only a slight increment in the 
blink rate for these Ss. The change 
is of dubious statistical significance. 
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Fic. 1. Mean number of blinks during the 30-sec. period following each trial. Incentive conditions 
for the two groups were reversed on Trial 10. 
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The ¢t for a comparison of blinks on 
bonus and no-bonus trials within the 
low-high group is 1.81, which is sig- 
nificant at the .05 level. A similar 
test for the high-low group data gives 
a t of 4.42, significant at the .001 level, 
which indicates that the effect of an 
incentive is more pronounced when it 
is offered early in learning. 

There is a product-moment corre- 
lation of .36 (significant at the .05 
level) between over-all frequencies of 
blinking and Rotter maladjustment 
scores for individuals in the high-low 
group. No relation is demonstrable 
between the same scores for individ- 
uals in the low-high group, r = .00. 
There is also no significant correlation 
between blink rates and scores on the 
tension-motivation inventory. The 
correlation between blink rates and 
performance scores is —.02 for high- 
low Ss, but is .45 for low-high Ss. The 
latter is significant at the .01 level. 


Discussion 


The results of this experiment sub- 
stantiate the hypothesis that increased 
incentive for performance will bring 
about an increased frequency of blink- 
ing if measurements are taken when 
visual cues are not being utilized in 
the execution of the task. The most 
pronounced effect is found if the in- 
centive is introduced early in the 
practice series. This effect is attrib- 
uted on theoretical grounds to an 
increased distribution of activity in 
central motor pathways that is re- 
flected peripherally as generalized 
muscular tension. The data, then, 
support the contention of Meyer (8) 
that the blink rate can be a useful 
index of changes in distribution of 
muscular activity. 

It can be objected that, if blink rate 
is an index of tension, there should be 
a systematic change as a function of 


trials. None has been observed in 
this experiment. There are several 
possible reasons for this that have not 
been specifically explored. For ex- 
ample, although distribution should 
increase as a function of the length of 
the work period, it should also decrease 
as a function of acquisition. These 
processes cancel one another unless 
special procedures are provided for 
their identification. 

Another objection is that distribu- 
tion should increase as a function of 
the complexity of the task, which cer- 
tainly varies from trial to trial in this 
experiment. It should be noted, how- 
ever, that the complexity need not 
augment difficulty. Blink rates did 
not go up on practice trials with the 
complex performances, but it was 
emphasized that scores did not count 
and performance on the central task 
typically deteriorated under these cir- 
cumstances. Complexity, then, is im- 
portant only if there is incentive to 
overcome it. 

The outcome of the analysis of 
interrelations between blink scores 
and data from personality inventories 
was, in general, negative. We had 
anticipated a correlation between the 
rate of blinking and scores from one 
scale in particular, the anxietyinven- 
tory used by Taylor (11) and by 
Hilgard, Jones, and Kaplan (6). The 
negative results seem to be attribut- 
able to insensitivity. Taylor selected 
subjects who made extreme scores for 
her study of the relation between 
“manifest anxiety” and rate of con- 
ditioning, and her marked effects were 
not confirmed by Hilgard, Jones, and 
Kaplan. The latter investigators 
simply ran a correlation analogous to 
the one obtained in the present experi- 
ment, and found it to be low (.12) and 
insignificant. Perhaps the Rotter 
scale is thus a more sensitive indicator 
of distribution, for only the scores 
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from this instrument support the view 
that the extraneous muscular activity 
of “anxious” persons facilitates their 
blinking. 

SUMMARY 


This experiment was designed to 
test two hypotheses. The first of 
these is that increased incentive for 
the performance of a visually guided 
task will bring about an increased 
frequency of blinking during the inter- 
trial interval. The second is that 
individuals who score high on inven- 
tories of anxiety have high blink rates 
as well. Both are predictions from a 
general theory of response interaction. 

The 65 Ss were first given two per- 
sonality inventories, and then prac- 
ticed a pursuit task which was compli- 
cated at times by various peripheral 
tasks. At different points in practice 
monetary incentives were offered to 
some of the Ss. A concealed E re- 
corded blink rates during a 30-sec. 
rest period which followed each of the 
trials. 

Analysis of the data revealed statis- 
tically significant changes in the blink 
rate when incentives were introduced, 
and this effect was most marked in a 
group that received bonuses early in 
the practice series. The only signifi- 
cant correlation found between indi- 
vidual blink rates and personality 
scores was for a maladjustment index 
based upon the Rotter Sentence Com- 
pletion Test. A positive correlation 
was also found between performances 
and blink rates of individuals who 
were not given incentives until late in 
the practice series. 

It was concluded that, in general, 
the data supported the predictions and 


that the blink rate can be used as an 
index of generalized muscular tension. 


(Received August 7, 1952) 
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STIMULUS INFORMATION AS A DETERMINANT 


OF REACTION TIME! 


RAY HYMAN 
The Johns Hopkins Univoersity* 


In the typical reaction-time experi- 
ment, S’s reaction time is greater when 
he has to respond differentially to one 
of two equally probable stimuli in- 
stead of to just one stimulus. In fact, 
Merkel (2), using one to ten alterna- 
tives, has demonstrated that when S 
has to respond to one stimulus chosen 
from a number of equally probable 
alternatives, his reaction time increases 
with the number of alternatives. 

The fact that S’s response to stim- 
ulus 4 takes more time when 4 is one 
of several rather than one of two 
equally probable alternatives is of 
intrinsic interest. But it becomes 
even more significant when looked at 
from the standpoint of modern com- 
munication theory. In communica- 
tion theory the amount of information 
which a message conveys is an increas- 
ing function of the number of possible 
messages from which that particular 
message could have been selected.’ 
The S’s reaction time seems to behave, 


1 Based upon a thesis submitted to the Faculty 
of Philosophy, The Johns Hopkins University in 
partial fulfillment of the requirements for the 
degree of Doctor of Philosophy. The author 
wishes to express his gratitude to his adviser, 
Dr. W. R. Garner, and to Drs. A. Chapanis and 
H. W. Hake for assistance in the preparation of 
this paper. 

2 This research was done in cooperation with 
the Systems Coordination Division, Naval Re- 
search Laboratory under Contract N5-ori-166, 
Task Order I, between the Office of Naval 
Research and The Johns Hopkins University. 
This is Report No. 166-I-163, Project Designa- 
tion No. NR 507-470, under that contract. 

3 The amount of information conveyed by the 
selection of a particular message is defined as the 
logarithm to the base 2 of the reciprocal of the 
a priori probability that this particular message 
will be selected. 


under certain conditions, in a manner 
analogous to this definition of infor- 
mation. When a stimulus is chosen 
to which S must make a discrimina- 
tory response, his reaction time seems 
to be a monotonically increasing func- 
tion of the number of possible stimuli 
from which the stimulus can be chosen. 

Thus, the choice reaction-time ex- 
periment can be looked upon as a 
model of a communication system. 
The display represents a transmitter 
of information. Each alternative 
stimulus or signal represents a mes- 
sage; more information can be trans- 
mitted the greater the number of 
messages from which one can be 
chosen. The channel over which the 
signal is transmitted can be considered 
as the air space between the light and 
S, and might also include part of S’s 
visual afferent system. The S also 
acts as a receiver or decoder in that at 
some point he decodes the signal into 
its message and reacts with the appro- 
priate response (the destination of the 
information). 

Setting of the problem.—The experi- 
mental task involved varying the 
amount of information in the display 
and observing the corresponding 
changes in S’s reaction time to a stim- 
ulus presentation. The display was 
a matrix of lights, each light repre- 
senting a message. The S’s reaction 
time was registered by means of a 
voice key and timer. The average 
amount of information accompanying 
the presentation of a single stimulus 
was varied by (a) varying the number 
of equally probable alternatives from 
which a choice could be made, (d) 
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altering the probability of occurrence 
of particular choices, and (c) introduc- 
ing sequential dependencies between 
successive choices of alternatives. 

These three ways of varying the 
amount of information per stimulus 
have been incorporated into a single 
formula by Shannon in the formulation 
of his mathematical theory of com- 
munication (3). Such a unified form- 
ula represents a convenience for the 
communication engineer. From the 
psychologist’s viewpoint, however, the 
psychological equivalence of these 
three ways of varying information 
must be demonstrated empirically be- 
fore Shannon’s formula can be applied 
to the human component of the com- 
munication system. To demonstrate 
such an equivalence was one of the 
primary aims of the present study. 

Statement of the problem.—The par- 
ticular hypotheses investigated. ¢an be 
stated as follows: \ 

1. Reaction time is a monotonically 
increasing function of the amouart of 
information in the stimulus series, 

2. The regression of reaction time 
upon amount of information is the 
same whether the amount of informa- 
tion per stimulus is varied by altering 
the number of equally probable alter- 
natives, altering the relative frequency 
of occurrence of particular alterna- 
tives, or altering the sequential depen- 
dencies among occurrences of succes- 
sive stimuli. 

These hypotheses assume that (a) 
S’s responses are completely deter- 
mined by the stimulus series, and (b 
the occurrences of the successive 
stimuli do not alter S’s knowledge of 
the statistical properties of the stim- 
ulus series as a whole. 

The: first assumption demands a 
one-to-one correspondence between 
stimulus and response series. Hick’s 
experiment (1) indicates that S can 
decrease his reaction time to a given 


amount of information at the expense 
of an increased proportion of incorrect 
responses. The present study ex- 
cludes this possibility by demanding 
an errorless performance on the part 
of S. 

The second assumption excludes 
those situations wherein S gains new 
knowledge concerning the statistical 
structure of the stimulus series as the 
series progresses in time. The hy- 
potheses assume that S’s average 
uncertainty per stimulus presentation 
remains constant throughout a par- 
ticular stimulus series. The present 
experimental design helps to produce 
such a situation by (a) never present- 
ing S with the same series for a given 
condition more than once, (d) instruct- 
ing S concerning the statistical prop- 
erties of the series, and (c) giving S 
preliminary practice on a series of 
similar statistical construction. 


PROCEDURE 


Apparatus.—The apparatus consisted of a 
visual display, a voice key, a chronoscope, and a 
control panel. The display was a square matrix 
of 36 small lights. These 36 lights were arranged 
in six rows and six columns forming a square 3 in. 
on the side. The matrix was so placed that the 
length of a side of the display made a visual angle 
of 5° from where S sat. The diameter of each 
light in the display made a visual angle of 


, approximately 36’. 


Only eight of the lights were used: the four 
lights which made up the corners of the outer 
square of lights and the four corners of the next 
inner square of lights. Such a grouping seemed 
to minimize the confusions of one light with 
another on the matrix by S. The S designated 
each light by the distinctive names: Bun, Boo, 
Bee, Bore, By, Bix, Bevo, and Bate, respectively. 

A masonite screen separated EF, the control 
panel, and the chronoscope from S. The E pre- 
selected a light by means of a rotary switch. 
After giving S a warning signal, he closed a 
switch (2 sec. later) which simultaneously turned 
on the light and started the clock. The S’s 
vocal response stopped the clock by means of a 
throat microphone which activated an electronic 
voice key. 

Subjects.—The four Ss were male undergrad- 
uates, ages 18-22. Each S attended more than 
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40 experimental sessions over a 3-mo. period; 
approximately 15,000 reaction times were re- 
corded for each S. One of the Ss, F. K., also 
took part in 11 sessions of the pilot study during 
the month previous to the 3 mo. he served as S 
in the actual experiment. 

Experimental conditions—In all there. were 
three experiments, each making use of one of the 
three different methods of altering stimulus infor- 
mation. Each of these experiments consisted of 
eight specific conditions containing different 
amounts of stimulus information. In addition, 
five series of stimuli, each constructed separately 
but according to the same rules, were used for 
each experimental condition. 

Experiment I.—The first experiment had eight 
conditions which differed from each other only 
in terms of the number of equally probable alter- 
natives from which the stimulus could be chosen. 
The eight conditions involved numbers of alter- 
natives ranging from one through eight; the 
respective bits of information attached to each 
condition were 0.00, 1.00, 1.58, 2.00, 2.32, 2.58, 
2.81, and 3.00. This variable is essentially the 


same as used in the experiments of Merkel (2) - 


and Hick (1). 

Experiment II.—The second experiment had 
eight conditions which involved different num- 
bers of alternatives and different probabilities of 


TABLE 1 


Tue Eicut Conpitions ror Experiment II 
AND THE CORRESPONDING AMOUNTS OF 
INFORMATION IN Bits PER STIMULUS 











PRESENTATION 
Av. 
Number | Probability Amount 
Cond. | of Alter- of Log:1/p| of Infor- 
natives | Occurrence mation in 
ond. 
1 1 9/10 0.15 0.47 
1 1/10 3.32 
2 1 8/10 0.32 0.72 
1 2/10 2.32 
3 1 13/16 0.30 0.99 
3 1/16 4.00 
4 1 15/20 0.42 1.39 
5 1/20 4.32 
5 1 4/8 1.00 1.75 
1 2/8 2.00 
2 1/8 3.00 
6 1 5/10 1.00 2.16 
5 1/10 3.32 
7 1 8/16 1.00 2.38 
1 2/16 3.00 
6 1/16 4.00 
8 2 4/16 2.00 2.75 
2 2/16 3.00 
+ 1/16 4.00 

















the occurrence of these alternatives. Because of 
the greater complexity of these conditions, they 
are shown in Table 1. In Cond. 1, for example, 
there are just two alternatives, but one occurs 
nine times as often as the other. The stimulus 
information in bits is shown at the right for each 
component separately and for the weighted aver- 
age of the total condition. In Cond. 3 there is a 
total of four different alternatives, one of which 
occurs 13/16 of the time, with the other three 
each occurring 1/16 of the time. Conditions 7 
and 8 each involve the greatest number of differ- 
ent alternatives (eight), with the proportions of 
occurrences as shown. 

Experiment III.—In the third experiment dif- 
ferent numbers of alternatives were used, and in 
every condition all alternatives occurred equally 
often. But the probability of the occurrence of 
a particular alternative depended upon the 
immediately preceding alternative in a manner 
shown in Table 2. Condition 1, for example, 
consisted of two alternatives. Whenever one of 
these alternatives occurred, it would be followed 
by the other alternative 8/10 of the time and by 
itself 2/10 of the time. In Cond. 3, with four 
alternatives, the occurrence of a stimulus meant 
that the probability of its recurring on the next 
trial was 7/10; the probability of some other 
alternative occurring was 1/10. Conditions 6, 
7, and 8 were constructed in the same manner as 
conditions in Exp. I with the exception that a 
particular stimulus was never followed by itself. 

The experimental sessions—An experimental 
session lasted 1 hr. Each S attended one session 
a day, five days a week, at the same hour each 
day. Three series (from three different condi- 
tions) were run during each session. Since each 
series consisted of 120-128 stimuli, each session 
yielded a total of 360-384 reaction times. For 
experimental convenience each series was split 
into two parts of 60-64 stimulieach. By desig- 
nating the three series as A, B, and C, the typical 
counterbalanced order of any session can be 
symbolized in this manner: Practice on A, A, 
Practice on B, B, Practice on C, C, 5-min. rest, 
C, B, A. 


RESULTS 


Experiment I.—When the amount 
of information per stimulus presen- 
tation was varied by varying the num- 
ber of equally probable stimulus 
alternatives from which E could choose 
the stimulus, the linear correlation 
between amount of information and 
reaction time was .991, .982, .980, and 
.979 for each of the four Ss. There 





t 
§ 
1 











REACTION TIME 191 


TABLE 2 


Tue Ericut Conpitions ror Experiment III anp THe Corresponpinc AMOUNTS OF 
INFORMATION IN Bits PER StimuLus PRESENTATION 











Num a - . ¥. 
Cond. of Lights meee | ~~ Logs 1/p od Tidgeuion 
a\x in 
1 2 pa(b) = 8/10 0.32 0.72 
pala) = 2/10 2.32 
2 3 Pala) = pele) = 1/10 3.32 0.92 
pa(b) = 8/10 0.32 
3 4 pala) = 7/10 0.51 1.36 
Palb) = pa(c) = pa(d) = 1/10 3.32 
4 4 pa(b) = 3/6 1.00 1.79 
pala) = we = pa(d) = 1/6 2.58 
5 8 pa(b) = 9/16 0.82 2.21 
Pa(a) = pa(c) = - pa(h) = 1/16 4.00 
6 3 pala) = 0 0.00 1.00 
Palb) = pale) = 1/ 1.00 
7 5 Pala) = 0.00 2.00 
pa(b) = Palc) = ° Pale) = 1/4 2.00 
8 8 Pala) = 0 0.00 2.81 
Palb) = pa(c) = - Palh) = 1/7 2.81 

















was no systematic tendency for the 
variances of reaction times within each 
condition to be correlated with the 
means of the conditions. 

Three Ss showed a slight, but statis- 
tically significant, practice effect from 
series to series. For G. C. and F. P. 
this practice effect accounted for only 
3% of the total variance among series. 
For L. S., however, this practice effect 
amounted to 11.5% of the total vari- 
ance among his 40 series. This fact 
will be significant in later interpreta- 
tion of some results concerning this 
S’s regression lines. 

As was to be expected, F. K., who 
had a month’s preliminary training on 
Exp. I as part of the pilot study, 
showed no practice effect from series 
to series. 

It is interesting to note that Merkel 
(2) and later Hick (1) also obtained 
reaction time as a function of the 
number of stimulus alternatives. Both 
experimenters used finger keys rather 
than voice keys. When Merkel’s data 
are converted into reaction time as a 
function of the amount of information 
in the stimulus display, the linear 


regression accounts for 99% of the 
variance among his reaction-time 
means. Hick’s graphed points indi- 
cate that the linear correlation between 
reaction and information is approxi- 
mately, of the same magnitude as in 
the prgsent study. 

Exgeriment II.—The linear correla- 
tion between reaction time and infor- 
mayion when the amount of informa- 
igh was varied by varying the relative 
fyequency of occurrence of the various 

timuli with respect to each other was 

.989, .965, .994, and .952 for each of 
the four Ss. As in Exp. I, the vari- 
ances for each condition were not 
systematically related to the means. 
No practice effect was evident. 

Experiment ITI.—When the amount 
of information was varied by introduc- 
ing sequential dependencies among 
the successive stimulus presentations, 
the linear correlation between reaction 
time and information for each of the 
four Ss was .972, .965, .874, and .934, 
respectively. These correlations are 
systematically lower than those for the 
previous two experiments. In all four 
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cases the nonlinear variance is signifi- 
cant at the 1% level. 
The first five conditions in Table 2 


produced reaction times which were 


significantl 


y lower than the reaction 


times for the latter three conditions 


600F- 





after the means were adjusted for the 
linear trend. Practically all of the 
nonlinear variance for each of the four 
Ss was produced by the one degree of 
freedom used to compare these two 
groups of conditions; the nonlinear 
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variance associated with the remain- 
ing five degrees of freedom was non- 
significant. 

The high reaction times for the 
three conditions wherein no immedi- 
ate repetition of a stimulus was allowed 
raise some interesting questions. On 
the basis of information theory, the 
restriction “‘no repetition” lowers the 
amount of information per stimulus 
presentation since it effectively de- 
creases the number of alternatives by 
one. Instead of lowering S’s reaction 
time, however, this restriction caused 
his reaction time to be higher than if 
no such restriction had been imposed. 

Comparison of the three experiments. 
—The second experimental hypothesis 
stated that the three experiments 
would produce identical regression 
lines. In the case of three Ss, the null 
hypothesis that the three regression 
lines do in fact coincide could not be 
rejected at the 5% level of confidence. 
For L. S., however, the null hypothesis 
had to be rejected at the 1% level of 
confidence. This lack of coincidence 
could not be attributed to a discrep- 
ancy among the three regression co- 
efficients; the difference was due to a 
systematic displacement of the regres- 
sion line for Exp. I upwards on the 
ordinate relative to the lines for Exp. 
II and III. 

This systematic displacement among 
the regression lines of L. S. can be 
attributed to the relatively large prac- 
tice effect which this S displayed on 
Exp. I. Most of the series for Exp. I 
were run during the first six weeks of 
the experimental period, whereas most 
of the series for Exp. II and III were 
run during the latter half of the experi- 
mental period. L. S., it will be re- 
called, showed a practice effect which 
accounted for 11.5% of the variance 
among his series for Exp. I, whereas he 
presumably had reached a plateau by 
the time he participated in most of the 


series of Exp. II and III (wherein he 
showed no practice effect). Conse- 
quently his regression line for Exp. I 
was displaced slightly upwards on the 
ordinate. 

The joint regression line.—The joint 
regression line fitted to the combined 
points of the three experiments pro- 
duced a linear correlation between 
reaction time and amount of informa- 
tion for each of the four Ss of .985, 
953, .955, and .938, respectively (see 
Fig. 1). In other words, when we 
vary information in three distinct 
ways over a limited range of from 0.00 
to 3.00 bits, a linear regression of 
reaction time on information accounts 
for as much as 97.0, 91.1, 91.1, and 
88.8% of the total variance among 
mean reaction times for the 24 experi- 


mental conditions for each of the four 
Ss. 


Discussion 


So far this paper has presented em- 
pirical relationships which suggest 
that reaction time can be considered 
a linear function of stimulus informa- 
tion within the range of 0.00 to 3.00 
bits. More important than the shape 
of this function are the factors which 
operate to bring it about. This par- 
ticular study was set up to discover 
the type of relationship which exists 
rather than to delve into the causes of 
this relationship. Nevertheless, an 
examination of the data suggests some 
factors which may partially account 
for the resultant function. 

Nonadditive combination of com- 
ponents within conditions.—The con- 
ditions for Exp. II and III can be 
considered as made up of two or more 
components, each component having 
a different amount of information asso- 
ciated with its presentation (see 
Tables 1 and 2). Condition 1 in 
Table 1, for example, has two com- 
ponents, one with 0.15 bits and the 
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other with 3.32 bits. On the hypoth- 
esis that reaction time behaves in a 
manner analogous to the measure of 
information, we would expect the 
mean reaction time to each of :the 
components within a condition to fall 
on the regression line which was fitted 
to the over-all means of the conditions. 
To illustrate, one might expect a con- 
stant mean reaction time to an event 
which occurs with probability 4 re- 
gardless of how many other elements 
there are in the series. But such was 
not the case. In both Exp. II and III 
the components within a condition 
interacted with each other in such a 
way that the reaction time to the low 
information component was higher, 
and the reaction time to the high in- 
formation component was markedly 
lower, than would be predicted on the 
basis of the regression line fitted to 
the means of the conditions. This 
effect was very marked and occurred 
without a single exception for all four 
Ss in both Exp. II and III. 

An example will illustrate the nature 
of this effect. Take the condition in 
Table 1 (Cond. 3) which has 0.99 bits 
associated with it. For G. C., the 
regression line fitted to the reaction 
times of the 24 experimental condi- 
tions would predict a mean reaction 
time for this condition of 363 msec. 
The observed mean reaction time for 
this condition was 361 msec., a figure 
which agrees closely with the expected 
value. This condition consisted of 
two components, one with 0.30 bits of 
information and the other with 4.00 
bits of information attached to its 
occurrence. On the basis of the over- 
all regression line to G. C.’s points we 
would predict mean reaction times for 
these two conditions of 258 and 824 
msec., respectively. If we weight 
each of these expected values by the 
probability of occurrence of the com- 
ponent with which it is attached, 


13/16 and 3/16, respectively, our 
weighted predicted average comes out 
to be 363 msec. The observed mean 
reaction time for the component with 
0.30 bits was 306 msec., which was 48 
msec. higher than the predicted value, 
and the observed mean reaction time 
to the component with 4.00 bits was 
585 msec. or 235 msec. lower than the 
predicted value. Yet, when we weight 
these component means by their fre- 
quency of occurrence, we come up 
with an observed mean of 361 msec., 
agreeing very closely with the pre- 
dicted mean. Such a situation holds 
for all the conditions and their com- 
ponents in Exp. II and III and for all 
four Ss. 

These analyses mean that we can- 
not predict, on the basis of the regres- 
sion line fitted to the means of the 
conditions, what the mean reaction 
times will be to the components which 
make upacondition. If, however, we 
know what the components of the 
condition are, we can predict what the 
combined mean for this condition will 
be on the basis of the over-all regres- 
sion. The components, of course, do 
not combine additively. But they 
interact in such a manner that the 
condition means behave as if the 
components combined additively. If 
we are interested only in the behavior 
of the condition means, the assump- 
tion of additive combination of the 
components will serve our purposes. 
If, however, we are interested in the 
behavior of the components making 
up the conditions, we must find differ- 
ent laws and equations. 

Reaction time as a function of the 
number of other stimuli intervening 
between successive occurrences of a 
stimulus.—Another interesting factor 
determining the reaction time was 
first observed during the pilot study. 
The E observed that whenever a 
stimulus was immediately followed by 
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itself in the series, S seemed to respond 
unusually fast to it. This effect was 
apparently independent of S’s verbal- 
ized expectancy of the stimulus for he 
reported that in such cases he was not 
expecting the stimulus to follow itself. 
An examination of the data showed 
that this phenomenon was quite 
marked for the situation with four or 
more alternatives and steadily de- 
clined until it disappeared or became 
slightly negative for the case with just 
two alternatives. | 

As can be seen in Fig. 2, the experi- 
mental data from the four Ss support 
the observations made from the pilot 
study. In this figure reaction time is 
plotted as a function of the number of 
stimuli intervening between succes- 
sive occurrences of a particular stim- 
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Fic. 2. Reaction time as a function of the 
number of other stimuli intervening between 
successive occurrences of a particular stimulus 
within a series. Each point represents an aver- 
age of the data from four Ss. The three param- 
eters are the conditions with eight, four, and two 
alternatives from Exp. I. 


ulus in the series. The three param- 
eters plotted are the conditions with 
two, four, and eight alternatives from 
Exp. 1. Although the data are aver- 
aged over the four Ss, the individual 
curves differed from each other only 
in displacement along the ordinate. 

For more than two alternatives the 
general trend of the functions seems 
to be parabolic; the function seems to 
reach a maximum at around a dis- 
placement of one or two stimulus 
presentations and then come down 
again. The effect is more marked the 
greater the number of alternatives. 
Presumably two factors are operating 
to produce this bow-shaped curve. 
One is S’s verbalized introspection 
that a stimulus which has not appeared 
for some time in the series is reacted 
to more quickly than ordinarily be- 
cause of the greater expectancy now 
attached to its appearance. This 
“verbal expectancy” apparently ac- 
counts for the fact that the reaction 
times begin to get lower than the 
maximum for large displacements. 
The second factor seems to consist of 
some sort of residual effect produced 
by just having seen and reacted to a 
particular stimulus; this effect seems 
to facilitate reaction to this stimulus 
if it reappears within a finite time 
interval. For eight alternatives this 
facilitation seems to last for at least a 
displacement of one stimulus presen- 
tation. For two alternatives this 
facilitation does not affect the func- 
tion; perhaps it is at its maximum 
throughout the series and therefore 
does not show in the function for two 
alternatives. 

Evidence for periodicity—The data 
suggest a third factor which may have 
to be taken into account in the final 
story concerning the role of stimulus 
information as a determinant of reac- 
tion time. Separate frequency distri- 
butions were made of the reaction 
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times in each condition. For the 
situations wherein we have two or 
more alternatives the distributions 
tended to be multimodal. In almost 
every distribution a peak, clearly dif- 
ferentiated from the remainder of the 
distribution, regularly occurred at 
approximately S’s simple reaction 
time. The remainder of the distribu- 
tion in each case, however, did not 
present such a consistently clear pic- 
ture. The variability of the data was 
such that it is impossible to make any 
definite conclusions about the number 
of peaks and their location on the 
abscissa. Furthermore, if there is any 
natural periodicity in the distribution 
it seems to have been confounded with 
a periodicity suggested by the group- 
ing of stimuli on the display. Since 
the distributions seem to support sev- 
eral possible and different models as 
to what is happening, further discus- 
sion must be postponed until more 
data are available. 


SUMMARY 


The reaction time to a visual stim- 
ulus was investigated as a function of 
the amount of information conveyed 
by that stimulus. The amount of 
information in the stimulus was varied 
by varying (a) the number of equally 


probable alternatives from which it 
could be chosen, (b) the proportion of 
times it could occur relative to the 
other possible alternatives, and (c) 
the probability of its occurrence as a 
function of the immediately preceding 
stimulus presentation. 

The reaction time to the amount of 
information in the stimulus produced 
a linear regression for each of the three 
ways in which information was varied. 

The three regression lines obtained 
by the three separate ways of varying 
the amount of information were found 
to coincide for three Ss. In the case 
of the fourth S, a systematic displace- 
ment of one of his regression lines was 
attributed to a relatively large prac- 
tice effect which he showed for that 
experiment. 


(Received for early publication 
November 20, 1952) 
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CHANGES IN EXPERIMENTALLY PRODUCED ANXIETY 
WITH THE PASSAGE OF TIME: INCUBATION EFFECT? 


DALBIR BINDRA AND LOIS CAMERON 
McGill University 


People usually say the easiest way 
to overcome the fear of riding a horse, 
after taking a fall from it, is to remount 
the horse as soon after the painful 
experience as possible. The principle 
implied in this maxim is that fear or 
anxiety associated with a particular 
situation increases with the time which 
elapses between the original painful 
experience and the first re-exposure to 
that situation. That is, within limits, 
the longer one remains away from a 
situation that has proved painful, the 
greater the anxiety evoked on re-expo- 
sure to that situation. This paper is 
concerned with one of the many exper- 
imental problems suggested by this 
principle of layman’s psychology. 


An experiment conducted by Diven (1) about 
15 years ago provides a suitable starting point 
for further defining our problem. Diven used 
galvanic skin response (GSR) as an indicator of 
anxiety. In his investigation the word “barn,” 
occurring a number of times in a long list of 
words, was followed 12 sec. later by a strong 
electric shock. At the end of this training Ss 
gave a consistently high GSR to the signal 
“barn.” Then some of the Ss were given a 
5-min. rest period before being tested for reaction 
to the word list, including the signal “barn”; 
other Ss were given longer rest intervals (# hr., 
24 hr., and 48 hr.) before the test. During the 
test trials Ss in the }-hr. to 48-hr. groups gave 
significantly greater GSR’s than Ss in the 5-min. 
group; that is, the groups that had been away 
from the anxiety signal longer showed greater 
anxiety as indicated by changes in palmar con- 
ductance. This phenomenon has been termed 
incubation of anxiety. 


1This study was supperted by a research 
grant (AP-12) from the National Research Coun- 
cil of Canada. Some of the data reported here 
were first presented at the 1952 annual meeting 
of the Canadian Psychological Association in 
Banff, Alberta. 


Unfortunately a simple, unequivocal 
interpretation of Diven’s results is 
dificult. For one thing, Diven did 
not control the activity of the Ss dur- 
ing the rest period, and Haggard (2) 
has shown that conditions or experi- 
ences during the rest interval are sig- 
nificant in determining the extent of 
postrest anxiety. For this, and a 
number of other reasons, too involved 
to be discussed here, Diven’s experi- 
ments cannot be said to have demon- 
strated beyond doubt that the length 
of the rest interval as such was a sig- 
nificant variable in producing his 
results. 

The purpose of our experiment was 
to control the factors that had been 
left uncontrolled by Diven in an effort 
to see whether or not there is any 
genuine incubation of anxiety. Spe- 
cifically, our problem was to find out 
if the anxiety or apprehension evoked 
by an experimental anxiety signal, 
presented repeatedly, is affected by a 
rest interval, during which S remains 
shielded from the signal. In experi- 
mental terms, we were interested in 
seeing whether or not there is any dif- 
ference between the prerest and the 
postrest responses to the repeated pre- 
sentations of an anxiety signal. The 
general plan of the experiment involved 
measuring changes in palmar skin con- 
ductance brought about by signals 
presented at regular intervals before 
and after a rest period of 10 min. 
Changes in palmar conductance, as 
well as the extent to which Ss were 
able to discriminate (a) between two 
types of signals, and (b) between the 
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presignal and postsignal periods, were 
considered as indicators of anxiety. 

To appraise the effects of any local 
physiological changes at the point of 
contact between the electrodes and 
the skin, two groups of Ss were used. 
In the case of the electrodes-off group, 
the electrodes were removed for the 
duration of the rest, and were reat- 
tached after the rest interval; the 
electrodes-on group had the electrodes 
on throughout the experimental ses- 
sion, including the rest period. 


MeETHOD 


Apparatus.—Three pieces of equipment were 
used in this experiment: (2) A memory drum, 
controlled by £, exposed the"signals to S. (b) 
A Lafayette polygraph was employed to obtain 
continuous records of changes in palmar conduc- 
tance. The electrodes were rather large, with 
surface area of roughly 8 sq. cm. each, and were 
well coated with{electrode; jellyjbeforefuse. (c) 
A small inductorium, connected to two clamp- 
electrodes, enabled E to give S a single, mild, 
faradic shock (roughly 1 milliampere peak) 
through two fingers of his right hand. 

The memory drum was'placed in front of the 
S, while the rest of the equipment was placed 
behind a screen and could not be seen by him. 

Subjects.—Eighteen Ss, 11 men and 7 women, 
volunteered for this experiment. Their ages 
ranged from 18 to 36 years. Seven of these Ss 
were in the electrodes-off_group, and 11 in the 
electrodes-on group. The purpose of the experi- 
ment was not known to the Ss. 

Procedure.—The S was seated comfortably at 
a table and was given the following instructions: 
“This is an experiment to test the effect of 
various environmental conditions on intellectual 
performance. I am sorry that I cannot discuss 
the experiment with you in advance, but you 
can understand that to do so might affect the 
results. Won’t you sit down here, please, while 
I put the apparatus on? (Fasten electrodes to 
the left hand; insert two fingers of the right hand 
in the shock-electrodes; balance the circuit and 
note normal conductance.) Now, what I want 
you to do is to look into the slot on this board 
(point to the memory drum) when I say, ‘Ready,’ 
and keep your eyes on it steadily throughout the 
experiment. Then I will press a button which 
will bring a number into view. When you see 
the number, repeat it out loud, and start count- 
ing from it in twos as quickly as possible (e.g., 
631-633-635). After a short while (15 sec., see 
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START SIGNAL | B,V BUZZER, SHOCK 
| 
) ) | 
\s 13 SEC iS SEC - 
I rr ur w 
Fic. 1. Plan of each trial. Each trial 


started with the S counting in 2’s from a given 
number (751-753-755, etc.). Fifteen seconds 
later, he was presented with a signal, B or V. 
He continued counting for another 15 sec., and 
at the end of that time was either given a shock 
(Signal V) or exposed to a buzzer (Signal B). 
Numbers I, II, III, and IV indicate the points 
during the trial for which S’s palmar conductance 
was calculated. 


Fig. 1), I will press the button again, and a letter 
will appear in the slot. It will be either B or V. 
You are to continue counting. If the letter is 
B, you know that in a short while (15 sec.) you 
will hear a buzz. If V, you will hear the buzz, 
and it will be accompanied by a vibration (shock) 
in your fingers. Regardless of this, you are to 
continue counting until the letter disappears. 
Do you understand? Very well, we will have a 
couple of practice trials.” 

After the two practice trials, S was given 12 
trials. On six of these trials S was shown the 
signal B (for buzzer, or no-shock), and on the 
other six he was shown the signal V (for vibra- 
tion, or shock). These nonshock and shock trials 
were given in the same order (BVBVVB) both 
before and after the rest period. The rest inter- 
val of 10 min. was interpolated between Trials 
6 and 7, and S was asked to “look through” a 
copy of the New Yorker magazine during the rest 
period. In case of the electrodes-off group, elec- 
trodes were removed for the duration of the rest. 
Excepting the rest period, roughly 1 min. elapsed 
between the end of one trial and the beginning 
of the next. The plan of each trial is presented 
in Fig. 1. 

Three points should be noted in connection 
with this procedure. First, all Ss were aware 
(since they were told) that the signal for shock 
would be V, and the signal for no-shock, B. 
This factor of awareness has been shown to be an 
important variable in experiments of this kind 
(1,2). Second, a partial control of S’s activity 
during the experiment was effected by asking 
him to count in 2’s during the trials, and by 
having him look through the New Yorker during 
the rest interval. Third, the postrest trials were 
exactly like the prerest trials. For this reason, 
postrest trials cannot be considered “‘test trials” 
inasmuch as Signal V was followed by shock in 
these postrest trials as well. This procedure was 


adopted as a precaution against radically altering 
the expectations of Ss by discontinuing shock, 
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and thus introducing, what must appear to them, 
a major experimental change. The cognitive 
consequences of “no more shock” can be of fun- 
damental importance in an experiment of this 
kind. 

The data.—Deflections of the galvanometer- 
pen, measured in centimeters, were converted 
into resistance units (ohms) with the help of 
calibration charts. Resistance was then con- 
verted into conductance in micromhos by the 


formula 5 X 108(5). In addition to the normal 


conductance obtained at the beginning of the 
experiment, before S received instructions, con- 
ductance was determined at four points during 
each trial (see Fig. 1): I. Conductance at the 
beginning of each trial, called pretrial conduc- 
tance. II. Maximum conductance during the 
first 15 sec. (i.e., during the 15 sec. preceding the 
signal B or V), called presignal conductance. 
III. Conductance at the time S was shown the 
signal, called conductance-at-signal. IV. Maxi- 
mum conductance during the 15 sec. between the 
signal and the buzzer (or the shock), called post- 
signal conductance (this does not involve the 
reaction to the shock or the buzzer). 

From these four measures of palmar conduc- 
tance three derived measures were obtained for 
each trial: (a) presignal conductance change: 
maximum change in conductance during the 
presignal period (i.e., II-I; see the preceding 
paragraph); (b) postsignal conductance change: 
maximum change in conductance during the 
postsignal period (i.e., IV-III); (c) over-all con- 
ductance change: { difference} between {the post- 
signal and pretrial conductances (i.e., IV-I). 

Since change in conductance was not restricted 
to the postsignal period (see Table 2) it was felt 
that the third measure, over-all conductance 
change, best represented the reaction to the 
signal. 

Statistical treatment of the data.—In view of 
the many doubts that have been cast on the 
applicability of the standard statistical proce- 
dures to conductance scores (e.g., 3, 5), it was 
decided to use: (a) the median instead of the 
mean as a measure of central tendency or aver- 
age; (b) tests of significance that were based on 
rank ordering and did not assume any particular 
type of distribution of scores (6). 


RESULTS 


Electrodes-on and __ electrodes-off 
groups.—All conductance change val- 
ues represent increase in palmar con- 
ductance. After the rest interval, the 
electrodes-off group, as compared to 


the electrodes-on group, tended to 
show higher (median) pretrial conduc- 
tance as well as higher (median) con- 
ductance change. Neither of these 
differences was statistically significant. 
Partly for this reason, and partly for 
the pattern of the observed differ- 
ences, it was decided that the two 
groups were equivalent, and they were 
combined into one group of 18 Ss in 
all further analyses. 

Pretrial conductance.—Since the suc- 
cessive trials were roughly equally 
spaced, the 12 pretrial conductance 
scores can be looked upon as repre- 
senting the long-term changes in con- 
ductance during the experimental 
session. The lower curve in Fig. 2 
shows the fluctuations in pretrial con- 
ductance over the entire experimental 
period. The first point on the graph 
represents normal conductance (deter- 
mined before the subjects received 
instructions). It is seen that the gen- 
eral level of conductance increases 
steadily throughout the experimental 
session, going from about 50 microm- 
hos to roughly 125 micromhos. A 
major proportion of this increase takes 
place within the first half of the session, 
i.e., before the rest period. However, 
not all Ss showed this steady rise 
throughout the session; rather, each S 
presented a more or less unique pat- 
tern of fluctuations in conductance, a 
pattern that may be significant in 
studies of individual differences. 

There is no sudden rise or drop in 
conductance after the rest interval, 
though from these data we cannot say 
whether or not any major changes 
took place during the rest interval. 

Over-all conductance change.—The 
upper curve in Fig. 2 shows that the 
median over-all conductance change 
for the trials after the rest period is in 
general greater than the median over- 
all conductance change for the corre- 
sponding trials before the rest interval. 
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TABLE 1 


Mepian Over-aALL ConpucTaNceE CHANGE 
(Increase, IV-I) 1n Micromuos For 
SHock AND NonsuHock TRIALS DURING 
THE PREREST AND PosTREST 


























PERIODS 
Nonshock Trials} Shock Trials 
Condition 
1 2 | 3 1 | 2 3 
Before rest 6.9| 3.5 | 3.1) 10.7) 9.4) 8.7 
After rest 14.2] 7.5 | 13.4] 15.7| 15.1} 8.0 
Significance of 
the difference* p<.01 p<.10 











* The median values for the three prerest and the 
three postrest nonshock trials were calculated for each 
S; then the prerest and the postrest medians were com- 
age by Wilcoxon's rank-difference test (6). The data 
7 > shock trials were treated in the same way. 


In other respects the two curves are 
remarkably similar. Table 1 presents 
the same results in another way. It 
shows that for both the shock and the 
nonshock trials conductance change is 


greater after the rest interval than 
before it. The difference in conduc- 
tance change between the pre- and 
postrest nonshock trials is significant 
at the 1% level, but the difference 
between the pre- and postrest shock 
trials, though in the same direction, 
does not meet our criterion of signifi- 
cance (p <.05). In this connection 
we should remember that conductance 
change in case of shock trials probably 
comes close to its physiological limit, 
and this may attenuate the true dif- 
ference between the pre- and postrest 
trials. 

Discrimination.—Table 1 also shows 
that Ss discriminated between the two 
signals, B and V. Over-all conduc- 
tance was greater for Signal V (shock) 
than for B (no-shock). This discrim- 
ination was to be expected in view of 
the fact that Ss had been told in 
advance the significance of the two 
signals. It is interesting to note, how- 
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Pretrial conductance (lower curve; left ordinate), and over-all conductance change: 
(upper curve; right ordinate) for the 12 successive trials. 
signal used (B = buzzer, V = shock) on any particular trial. 


Letters B and V indicate the type of 
Point X represents the normal 


conductance, obtained at the beginning of the experiment. 
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TABLE 2 


MEDIAN PRESIGNAL AND PostsiGNAL ConpucTANCE CHANGE (INCREASE) In MicromHos 
DURING THE PREREST AND Postrest PERIODS 





















































Prerest Trials Postrest Trials 
1 2 3 4 5 6 7 8 9 10 il 12 
Presignal 
(II-I) 15.3 | 11.2 | 5.1 | 6.5 | 7.1 | 7.2 | 9.7 | 10.7 | 16.4] 8.7} 11.1 | 14.0 
Postsignal 
(IV-III) 2.9) 5.9] 3.2] 41) 54]19 | 3.2 | 91} 2.0] 10.2] 11.8] 5.4 
Significance of the 
difference* p<.0l p<.05 











* The median values for the six presignal and the six postsignal prerest trials were calculated for each S; then 
the presignal and the postsignal medians were compared by Wilcoxon's rank-difference test (6). The data for the 


postrest trials were treated in the same way. N = 18. 


ever, that the difference between the 
shock and nonshock trials appears to 
be not as clear-cut after the rest 
period as it is before rest. This 
result, though statistically not signifi- 
cant, is in the expected direction, and 
in view of the previously demonstrated 
positive relation between anxiety and 
lack of discrimination (4), suggests 
greater anxiety during the postrest 
trials than during the prerest trials. 

A comparison of conductance 
changes during the presignal and post- 
signal periods of each trial (see Fig. 1) 
provides another indicator of the ex- 
tent of discrimination. Contrary to 
what might be expected, Ss showed 
(Table 2) remarkably consistently a 
greater presignal conductance change 
(II-I) than postsignal conductance 
change (IV-III). This result would 
suggest that Ss became apprehensive 
even before the appearance of the 
signal (B or V), that is, they showed a 
major part of the increase in conduc- 
tance before they knew whether they 
were going to receive a shock or not; 
then, if the signal turned out to be V, 
their conductance increased further, 
otherwise (signal B) it decreased 
somewhat or, at any rate, did not in- 
crease any further. It may not be 


quite correct to use the word “dis- 
crimination”’ to refer to this difference 
between the pre- and postsignal peri- 
ods: these results seem to stem from 
lack of discrimination, or from unnec- 
essary generalization. For this reason, 
extent of discrimination in this case 
cannot be used as an indicator of 
anxiety. 


Discussion 


The purpose of this experiment was 
to see if the anxiety (increase in pal- 
mar conductance) evoked by a peri- 
odically repeating signal is increased 
by the interpolation of an interval of 
rest, during which S is not exposed to 
the signal. Conductance change and 
lack of discrimination were used as the 
indices of anxiety. The results show 
clearly that the over-all conductance 
change evoked by the experimental 
signals is greater after the 10-min. rest 
interval than before it. This fact is 
all the more striking if we take into 
account the effect of adaptation, shown 
by the gradual decrease in conduc- 
tance change, in successive trials 
(Table 1). That is, the true increase 
in conductance change brought about 
by the rest period is probably greater 
than that shown in the experimental 
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results. The data on discrimination, 
though not statistically significant, 
are also consistent with the conclusion 
that anxiety, as measured by conduc- 
tance change, was greater after the 
rest interval. 

It must be recalled that the shock 
followed Signal V even in the postrest 
trials. Thus, in fact, Ss had had a 
larger number of V-shock presenta- 
tions by the end of the postrest period 
than by the end of the prerest period. 
Therefore, it can be argued that the 
greater conductance change in the 
postrest trials was due only to an in- 
crease in the strength of association 
between Signal V and the shock. Such 
an interpretation would be plausible 
only if it could be shown that after 
each. shock there was an increase in 
Ss’ reactions to the signals. This is 
not the case. On the contrary, the 
reaction (conductance change) seems 
to decline in successive trials, both in 
the pre- and the postrest periods. 


Now what exactly is the nature of this effect 
produced by the rest interval? Let us look at 
a number of alternatives: 

Is the effect due to recovery from adaptation? 
It is possible that gradual adaptation of the 
response took place during the first six, prerest, 
trials, and that recovery from adaptation during 
the rest interval showed itself in a higher con- 
ductance change in the postrest trials. This 
interpretation is not plausible. Recovery from 
adaptation may be expected to bring the level of 
response (conductance change) back to prerest 
level, but it cannot be expected to “recover” 
beyond that level. Our results show that the 
response (Table 1) in the postrest trials is greater 
in absolute value than the response on the corre- 
sponding trials before rest. The recovery-from- 
adaptation hypothesis must be rejected. 

Is the effect due to local physiological changes 
connected with the electrodes and the palm? It 
could be argued that the accumulation of perspi- 
ration during the prerest period somehow 
changed the “contact” between the electrodes 
and the two surfaces of the hand, thus causing 
differences between the pre- and postrest 
responses. If this were the case, we would ex- 


pect some difference in the reactions of the two 
groups: Ss who did not have electrodes on during 
the rest interval should have reacted differently 
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from those who retained them on the hand during 
this interval. Our results show no significant 
difference between the two groups. Thus the 
accumulation-of-perspiration hypothesis must 
also be rejected. 

If the phenomenon we are considering is not 
due either to local physiological artifacts or to 
simple recovery from adaptation, we are prob- 
ably safe in calling it incubation effect, and attri- 
buting it to some other factors associated with 
the rest interval, or with the passage of time 
as such. Now, we can raise some other ques- 
tions concerning the nature of this incubation 
effect. 

Is the incubation effect peculiar to situations 
involving fear or anxiety? Or is it a general 
phenomenon occurring in reactions to nonanxiety 
stimuli as well? This effect has not been 
reported in nonanxiety situations. When the 
usual auditory and visual stimuli are used, and 
no threat of shock is involved, conductance 
change declines in successive trials, and this 
adaptation gets easier in successive experi- 
mental sessions (7). ‘Thus, it is possible that the 
incubation effect is peculiar to situations involv- 
ing some degree of apprehension, though this 
point should be checked by direct experimental 
evidence. 

Other questions that require consideration 
are: (a) Does the incubating process take place 
in the peripheral organs involved or in the cen- 
tral nervous system? (5) Is the process respon- 
sible for incubation a continuous one, going on 
throughout the rest period, or does it set in after 
the rest interval, at the time of the first postrest 
trial? (c) What exactly is the significance of 
the rest period? Is it to be considered purely as 
an interval of time, or does its effect depend upon 
some associated expectations and other cognitive 
factors? Answers to these questions will pave 
the way for formulating a theory to deal with 
the incubation effect. 


SUMMARY 


The purpose of this study was to 
determine the effect on experimentally 
produced anxiety (increase in palmar 
conductance) of a rest interval, during 
which S remains shielded from the 
anxiety signal. Eighteen Ss were 
exposed to a series of experimental 
signals, once before and once after a 
rest period of 10 min. The anxiety 
shown in the trials after the rest period 
was significantly greater than anxiety 
in the trials before rest. Discrimina- 
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tion between the shock and nonshock 
signals also tended to be better before 
the rest interval. This increase in anx- 
iety reaction brought about by the 
interpolated rest interval cannot be 
attributed either to recovery from ad- 
aptation or to local physiological 
changes. It appears to be a genuine 
incubation effect, but its exact nature 
is not clear. 


(Received August 11, 1952) 
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